Commits: tools/quantize/quantize.cpp - ggml-org/llama.cpp - Morph

SIGN IN SIGN UP

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ tools/quantize/quantize.cpp

master

April 1, 2026

E

llama : refactor llama_model_quantize_params to expose a pure C interface (#20346)

Ed Addario committed 2d ago

March 10, 2026

D

llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770)

ddh0 committed 24d ago

March 4, 2026

S

Fix locale-dependent float printing in GGUF metadata (#17331)

SamareshSingh committed 1mo ago

February 20, 2026

D

quantize : add --dry-run option (#19526)

ddh0 committed 1mo ago

February 8, 2026

D

llama-quantize : cleanup `--help` output (#19317)

ddh0 committed 1mo ago

January 31, 2026

E

quantize: add option --tensor-type-file to llama-quantize (#18572)

EugeoSynthesisThirtyTwo committed 2mo ago

December 31, 2025

A

quantize: prevent input/output file collision (#18451)

Anri Lombard committed 3mo ago

August 5, 2025

G

llama : add gpt-oss (#15091)

Georgi Gerganov committed 8mo ago

August 4, 2025

S

quantize : fix confusing error message if ftype is invalid (#15071)

Sigbjørn Skjæret committed 8mo ago

July 30, 2025

E

quantize : fix using combined imatrix GGUFs (multiple datasets) (#14973)

Ed Addario committed 8mo ago

July 19, 2025

C

imatrix : use GGUF to store importance matrices (#9400)

compilade committed 8mo ago

June 22, 2025

E

quantize : handle user-defined pruning of whole layers (blocks) (#13037)

Ed Addario committed 9mo ago

May 13, 2025

E

quantize : improve tensor-type pattern matching (#13033)

Ed Addario committed 10mo ago

May 2, 2025

D

llama : move end-user examples to tools directory (#13249)

Diego Devesa committed 11mo ago