SIGN IN SIGN UP

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

0 0 186 Python

Move `original_max_position_embeddings` to rope params (#42513)

* move `original_max_position_embeddings` to rope param dict and resolve TODs from Joao

* bring back truncate in yarn

* move the patch to `standardize` helper, this one gets called every time we init rope comute fn

* my bad

* silly typo, I should read the code I write!

* force the tester to use specific layer types, because rope is built with these types

* revert, whyhow did it get deleted?!

* factor isn't guaranteed to exist in the end

* tiny test issue, needs to standardize first
R
Raushan Turganbay committed
347328108aa79959eda0e2eae0695c12f0101046
Parent: 9971e41
Committed by GitHub <noreply@github.com> on 12/23/2025, 6:01:47 AM