🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Single config attribute for weight tying (#42815)
* merge two attr into one * delete tie encoder decoder * one more * mt5 * skip tests when tying is hardcoded * change test value to True, so we don't have to adjust hardcoded configs * awful decision in t5 to support two variants * delete my comment * not copied anymore * skip * they all had a shared embedding which was hardcoded, force it * force it in umt5 also, my model won't work otherwise :( * skip the key * dont't force if official weights set it to True * skip one test and fix teh other
R
Raushan Turganbay committed
9a90500b0b7a3f481c005a1398ab50539146dfbc
Parent: 7017994
Committed by GitHub <noreply@github.com>
on 12/22/2025, 5:25:07 AM