Commits: docker/transformers-quantization-latest-gpu/Dockerfile - huggingface/transformers

huggingface / transformers UNCLAIMED

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

0 0 0 Python

COMMITS

/ docker/transformers-quantization-latest-gpu/Dockerfile

hf-exporters

March 16, 2026

Bump torchao >=0.15 and fix quantization CI (#44604)

Marc Sun committed 16d ago

bfe00b4

February 24, 2026

Add Four Over Six quantization integration (#43970)

Jack Cook committed 1mo ago

49067bc

February 12, 2026

Fix docker files (#43946)

Yih-Dar committed 1mo ago

8514768

December 10, 2025

Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel (#41567)

Qubitium-ModelCloud committed 3mo ago

8ebfd84

November 4, 2025

[FPQuant] MXFP8 and MXFP4 backwards support (#41897)

Andrei Panferov committed 4mo ago

020e713

November 3, 2025

Fix `torchcodec` version in quantization docker file (#41988)

Yih-Dar committed 4mo ago

7d5160b

November 2, 2025

Fix `autoawq[kernels]` installation in quantization docker file (#41978)

Yih-Dar committed 5mo ago

37a6296

October 10, 2025

Remove DISABLE_KERNEL_MAPPING flag (#41475)

Mohamed Mekkouri committed 5mo ago

b28902c

October 1, 2025

FP-Quant NVFP4 and Python 3.9 support (#39876)

Andrei Panferov committed 6mo ago

3256773

September 29, 2025

Fix docker quantization (#41201)

Marc Sun committed 6mo ago

3e975ac

September 22, 2025

Update quantization CI (#41068)

Marc Sun committed 6mo ago

aa30e06

August 4, 2025

Fix quant docker for fp-quant (#39641)

Marc Sun committed 8mo ago

5fb5b6c

July 23, 2025

FP-Quant support (#38696)

Andrei Panferov committed 8mo ago

623ab01

July 8, 2025

Add torchcodec in docstrings/tests for `datasets` 4.0 (#39156)

Quentin Lhoest committed 8mo ago

1ecd52e

June 27, 2025

Uninstallling Flash attention from quantization docker (#39078)

Mohamed Mekkouri committed 9mo ago

cb17103

May 12, 2025

uninstall `kernels` from docker images (#38083)

Yih-Dar committed 10mo ago

4143f94

April 22, 2025

Add AutoRound quantization support (#37393)

Wenhua Cheng committed 11mo ago

b3492ff

April 11, 2025

Disable kernels for quantization (#37446)

Mohamed Mekkouri committed 11mo ago

8978747

March 20, 2025

Support loading Quark quantized models in Transformers (#36372)

fxmarty-amd committed 1y ago

1a37479

March 13, 2025

Upgrading torch version and cuda version in quantization docker (#36264)

Mohamed Mekkouri committed 1y ago

65b8e38

February 17, 2025

Add compressed tensor in quant dockerfile (#36239)

Marc Sun committed 1y ago

dae8708

February 13, 2025

Efficient Inference Kernel for SpQR (#34976)

Elvir Crnčević committed 1y ago

845b0a2

December 23, 2024

HIGGS Quantization Support (#34997)

Andrei Panferov committed 1y ago

64c05ee

December 20, 2024

FEAT : Adding VPTQ quantization method to HFQuantizer (#34770)

wejoncy committed 1y ago

4e27a40

November 28, 2024

Fix docker CI : install autogptq from source (#35000)

Mohamed Mekkouri committed 1y ago

f491096

November 27, 2024

Fix : Add PEFT from source to CI docker (#34969)

Mohamed Mekkouri committed 1y ago

8f48ccf

November 25, 2024

Upgrade torch version to 2.5 in dockerfile for quantization CI (#34924)

Mohamed Mekkouri committed 1y ago

b76a292

[AWQ, CI] Bump AWQ version used in docker image (#34922)

Benjamin Bossan committed 1y ago

b13916c

October 24, 2024

Drop support for Python 3.8 (#34314)

Yih-Dar committed 1y ago

f0e640a

October 2, 2024

[Quantization] Switch to optimum-quanto (#31732)

Marc Sun committed 1y ago

cac4a48

May 24, 2024

Quantization / TST: Fix remaining quantization tests (#31000)

Younes Belkada committed 1y ago

658b849

May 20, 2024

FIX / Quantization: Fix Dockerfile build (#30890)

Younes Belkada committed 1y ago

fce78fd

May 16, 2024

TST / Quantization: Reverting to torch==2.2.1 (#30866)

Younes Belkada committed 1y ago

4e17e7d

May 15, 2024

Use `torch 2.3` for CI (#30837)

Yih-Dar committed 1y ago

2d83324

May 2, 2024

Add HQQ quantization support (#29637)

mobicham committed 1y ago

5995299

April 22, 2024

[FEAT]: EETQ quantizer support (#30262)

zhong zhuang committed 1y ago

b4c18a8

April 9, 2024

Fix quantization tests (#29914)

Marc Sun committed 2y ago

58a939c

March 15, 2024

[Quantization] Quanto quantizer (#29023)

Marc Sun committed 2y ago

28de2f4

March 5, 2024

Exllama kernels support for AWQ models (#28634)

Ilyas Moutawwakil committed 2y ago

4fc708f

February 28, 2024

[CI] Quantization workflow (#29046)

Marc Sun committed 2y ago

f54d82c