🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
🚨Early-error🚨 config will error out if `output_attentions=True` and the attn implementation is wrong (#38288)
* Protect ParallelInterface * early error out on output attention setting for no wraning in modeling * modular update * fixup * update model tests * update * oups * set model's config * more cases * ?? * properly fix * fixup * update * last onces * update * fix? * fix wrong merge commit * fix hub test * nits * wow I am tired * updates * fix pipeline! --------- Co-authored-by: Lysandre <hi@lysand.re>
A
Arthur committed
f5d45d89c4f1d3282fc216a96dc6830b88d41249
Parent: 896833c
Committed by GitHub <noreply@github.com>
on 5/23/2025, 3:17:38 PM