🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
[Quantization] Add cutlass kernel for FP8 (#43304)
* add cutlass * feedback --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
M
Mohamed Mekkouri committed
2b7bc5968a80e4110dde5f9d9babdca3f0559c86
Parent: b207f38
Committed by GitHub <noreply@github.com>
on 1/28/2026, 10:44:29 AM