Commits: common/common.cpp - ggml-org/llama.cpp

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ common/common.cpp

master

April 2, 2026

tests: allow exporting graph ops from HF file without downloading weights (#21182)

Ruben Ortlam committed 18h ago

5803c8d

March 31, 2026

common : move up common_init() and fix Windows UTF-8 logs (#21176)

Adrien Gallouët committed 3d ago

41361c8

common: add bounds check in common_init_result::sampler to prevent segfault on failed model load (#21082)

mtmcp committed 3d ago

90aa83c

March 28, 2026

fix **/x glob matching (#21129)

Sigbjørn Skjæret committed 5d ago

6509718

common : add character class support to glob_match (#21111)

Sigbjørn Skjæret committed 5d ago

3a14a54

cli : add /glob command (#21084)

Sigbjørn Skjæret committed 6d ago

c46758d

March 18, 2026

llama : re-enable manual LoRA adapter free (#19983)

Pop Flamingo committed 16d ago

312cf03

March 6, 2026

Autoparser - complete refactoring of parser architecture (#18675)

Piotr Wilkin (ilintar) committed 27d ago

566059a

February 23, 2026

llama : remove write/read of output ids/logits/embeddings (#18862)

Daniel Bevenius committed 1mo ago

2b6dfe8

February 18, 2026

common : make small string helpers as inline functions (#19693)

Adrien Gallouët committed 1mo ago

a569bda

February 14, 2026

NetBSD build support (#19589)

iMil committed 1mo ago

badba89

llama : update LoRA API. + fix excessive graph reserves (#19280)

agent-enemy-2 committed 1mo ago

2d8015e

February 12, 2026

common : replace deprecated codecvt using parse_utf8_codepoint (#19517)

Adrien Gallouët committed 1mo ago

4ae1b75

February 11, 2026

common : remove unused token util functions (#19506)

Daniel Bevenius committed 1mo ago

3136a84

January 28, 2026

spec : add self‑speculative decoding (no draft model required) + refactor (#18471)

Sascha Rogmann committed 2mo ago

72d3b18

January 15, 2026

context : reserve new scheduler when graph topology changes (#18547)

Georgi Gerganov committed 2mo ago

39173bc

January 8, 2026

llama-fit-params: free memory target per device (#18679)

Johannes Gäßler committed 2mo ago

64848de

llama : add `use_direct_io` flag for model loading (#18166)

Julius Tischbein committed 2mo ago

2038101

January 4, 2026

sampling : add support for backend sampling (#17004)

Daniel Bevenius committed 2mo ago

d3dce4e

December 30, 2025

lora: count lora nodes in graph_max_nodes (#18469)

Xuan-Son Nguyen committed 3mo ago

cd78e57

December 29, 2025

common: fix return value check for setpriority (#18412)

o7si committed 3mo ago

daa242d

December 27, 2025

llama: fix magic number of 999 for GPU layers (#18266)

Johannes Gäßler committed 3mo ago

026d2ad

December 22, 2025

tool/ex/tests: consistently free ctx, then model (#18168)

Johannes Gäßler committed 3mo ago

147a521

December 17, 2025

common: clarify instructions for bug reports (#18134)

Johannes Gäßler committed 3mo ago

a2c199e

December 15, 2025

llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)

Johannes Gäßler committed 3mo ago

b1f3a6e

December 14, 2025

common : refactor common_sampler + grammar logic changes (#17937)

Georgi Gerganov committed 3mo ago

254098a

December 7, 2025

common : change --color to accept on/off/auto, default to auto (#17827)

Sigbjørn Skjæret committed 3mo ago

2257758

December 4, 2025

common: use native MultiByteToWideChar (#17738)

Adrien Gallouët committed 4mo ago

83c1171

December 3, 2025

ggml webgpu: add support for emscripten builds (#17184)

Reese Levine committed 4mo ago

7ca5991

December 2, 2025

server: add --media-path for local media files (#17697)

Xuan-Son Nguyen committed 4mo ago

13628d8

December 1, 2025

server: introduce API for serving / loading / unloading multiple models (#17470)

Xuan-Son Nguyen committed 4mo ago

ec18edf

November 25, 2025

llama: introduce support for model-embedded sampling parameters (#17120)

Aaron Teo committed 4mo ago

877566d

November 20, 2025

common : more accurate sampling timing (#17382)

Georgi Gerganov committed 4mo ago

196f508

November 14, 2025

mtmd: add mtmd_log_set (#17268)

Xuan-Son Nguyen committed 4mo ago

9b17d74

November 8, 2025

arg: add --cache-list argument to list cached models (#17073)

Xuan-Son Nguyen committed 4mo ago

aa3b7a9

October 6, 2025

llama : add --no-host to disable host buffers (#16310)

Gadflyii committed 5mo ago

3df2244

September 26, 2025

devops: add s390x & ppc64le CI (#15925)

Aaron Teo committed 6mo ago

624207e

September 25, 2025

llama : add support for qwen3 reranker (#15824)

Douglas Hanley committed 6mo ago

b5bd037

September 24, 2025

common : add missing chrono header for common.cpp (#16211)

Uilian Ries committed 6mo ago

152729f

August 30, 2025

llama: use FA + max. GPU layers by default (#15434)

Johannes Gäßler committed 7mo ago

e81b8e4

August 28, 2025

model : jina-embeddings-v3 support (#13693)

Sigbjørn Skjæret committed 7mo ago

84ab83c

August 22, 2025

llama : remove KV cache defragmentation logic (#15473)

Georgi Gerganov committed 7mo ago

9ebebef

August 21, 2025

common : fix incorrect print of non-ascii characters in the logging (#15466)

Jie Fu (傅杰) committed 7mo ago

2f3dbff

August 14, 2025

finetune: SGD optimizer, more CLI args (#13873)

Jonathan Graehl committed 7mo ago

5cdb27e

July 31, 2025

llama : allow other bufts when overriding to CPU, add --no-repack option (#14990)

Diego Devesa committed 8mo ago

d6818d0

July 19, 2025

imatrix : use GGUF to store importance matrices (#9400)

compilade committed 8mo ago

9008328

July 16, 2025

llama : add high-throughput mode (#14363)

Georgi Gerganov committed 8mo ago

225e7a1

server : pre-calculate EOG logit biases (#14721)

Georgi Gerganov committed 8mo ago

6ffd4e9

June 20, 2025

vocab : prevent tokenizer overflow (#14301)

Ruikai Peng committed 9mo ago

dd6e6d0

June 19, 2025

build : suppress gcc15 compile warnings (#14261)

fanyang committed 9mo ago

456af35