SIGN IN SIGN UP

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

0 0 0 Python

[Quantization] Add cutlass kernel for FP8 (#43304)

* add cutlass

* feedback

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
M
Mohamed Mekkouri committed
2b7bc5968a80e4110dde5f9d9babdca3f0559c86
Parent: b207f38
Committed by GitHub <noreply@github.com> on 1/28/2026, 10:44:29 AM