🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Move `original_max_position_embeddings` to rope params (#42513)
* move `original_max_position_embeddings` to rope param dict and resolve TODs from Joao * bring back truncate in yarn * move the patch to `standardize` helper, this one gets called every time we init rope comute fn * my bad * silly typo, I should read the code I write! * force the tester to use specific layer types, because rope is built with these types * revert, whyhow did it get deleted?! * factor isn't guaranteed to exist in the end * tiny test issue, needs to standardize first
R
Raushan Turganbay committed
347328108aa79959eda0e2eae0695c12f0101046
Parent: 9971e41
Committed by GitHub <noreply@github.com>
on 12/23/2025, 6:01:47 AM