🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Bugfix/alexsherstinsky/fix none check for attention factor in rope scaling 2024 08 28 0 (#33188)
* Fixing a bug in the way "attention_factor" is validated in ROPE utilities. * Fixing a bug in the way "attention_factor" is validated in ROPE utilities. * Fixing a bug in the way "attention_factor" is validated in ROPE utilities.
A
Alex Sherstinsky committed
122ded0a11cb6cc36f1bdab77bd2fd91185768fc
Parent: 178cb6b
Committed by GitHub <noreply@github.com>
on 9/4/2024, 3:01:12 PM