COMMITS
/ tools/quantize/quantize.cpp April 1, 2026
E
llama : refactor llama_model_quantize_params to expose a pure C interface (#20346)
Ed Addario committed
March 10, 2026
March 4, 2026
S
Fix locale-dependent float printing in GGUF metadata (#17331)
SamareshSingh committed
February 20, 2026
D
quantize : add --dry-run option (#19526)
ddh0 committed
February 8, 2026
D
llama-quantize : cleanup `--help` output (#19317)
ddh0 committed
January 31, 2026
E
quantize: add option --tensor-type-file to llama-quantize (#18572)
EugeoSynthesisThirtyTwo committed
December 31, 2025
A
quantize: prevent input/output file collision (#18451)
Anri Lombard committed
August 5, 2025
G
llama : add gpt-oss (#15091)
Georgi Gerganov committed
August 4, 2025
S
quantize : fix confusing error message if ftype is invalid (#15071)
Sigbjørn Skjæret committed
July 30, 2025
E
quantize : fix using combined imatrix GGUFs (multiple datasets) (#14973)
Ed Addario committed
July 19, 2025
C
imatrix : use GGUF to store importance matrices (#9400)
compilade committed
June 22, 2025
E
quantize : handle user-defined pruning of whole layers (blocks) (#13037)
Ed Addario committed
May 13, 2025
E
quantize : improve tensor-type pattern matching (#13033)
Ed Addario committed
May 2, 2025
D
llama : move end-user examples to tools directory (#13249)
Diego Devesa committed