🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Fix DeepSpeed compatibility with weight_norm (#30881) (#31018)
J
Jonny Li committed
476890e9aeb695c746efe093c6ab7440322c9077
Parent: aada568
Committed by GitHub <[email protected]>
on 5/28/2024, 4:25:15 PM