🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
COMMITS
/ docs/source/ko/llm_optims.md March 19, 2026
C
[generate] Never use `cache_position` anymore in generation (#44816)
Cyril Vallez committed
December 10, 2025
Q
Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel (#41567)
Qubitium-ModelCloud committed
August 22, 2025
C
⚠️⚠️ Use `dtype` instead of `torch_dtype` everywhere! (#39782)
Cyril Vallez committed
August 8, 2025
C
[core] Refactor the Cache logic to make it simpler and more general (#39797)
Cyril Vallez committed
April 3, 2025
J
[CI] green llama tests (#37244)
Joao Gante committed
December 20, 2024
W
FEAT : Adding VPTQ quantization method to HFQuantizer (#34770)
wejoncy committed
August 30, 2024
Y
🌐 [i18n-KO] Translated `llm_optims.md` to Korean (#32325)
Yijun Lee committed