SIGN IN SIGN UP

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

0 0 184 Python

🚨Early-error🚨 config will error out if `output_attentions=True` and the attn implementation is wrong (#38288)

* Protect ParallelInterface

* early error out on output attention setting for no wraning in modeling

* modular update

* fixup

* update model tests

* update

* oups

* set model's config

* more cases

* ??

* properly fix

* fixup

* update

* last onces

* update

* fix?

* fix wrong merge commit

* fix hub test

* nits

* wow I am tired

* updates

* fix pipeline!

---------

Co-authored-by: Lysandre <hi@lysand.re>
A
Arthur committed
f5d45d89c4f1d3282fc216a96dc6830b88d41249
Parent: 896833c
Committed by GitHub <noreply@github.com> on 5/23/2025, 3:17:38 PM