🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Fix many HPU failures in the CI (#39066)
* more torch.hpu patches * increase top_k because it results in flaky behavior when Tempreture, TopP and TopK are used together, which ends up killing beams early. * remove temporal fix * fix scatter operation when input and src are the same * trigger * fix and reduce * skip finding batch size as it makes the hpu go loco * fix fsdp (yay all are passing) * fix checking equal nan values * style * remove models list * order * rename to cuda_extensions * Update src/transformers/trainer.py
I
Ilyas Moutawwakil committed
18e0cae207a38d2c430b5fa08f9597312d1c1ab3
Parent: bff964c
Committed by GitHub <[email protected]>
on 7/3/2025, 9:17:27 AM