Commits: common/common.h - ggml-org/llama.cpp

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ common/common.h

b8570

March 28, 2026

cli : add /glob command (#21084)

Sigbjørn Skjæret committed 22d ago

c46758d

server : add custom socket options to disable SO_REUSEPORT (#21056)

Adrien Gallouët committed 22d ago

5c1a7b8

March 27, 2026

server: add built-in tools backend support (#20898)

Xuan-Son Nguyen committed 23d ago

20197b6

March 19, 2026

common/parser: add proper reasoning tag prefill reading (#20424)

Piotr Wilkin (ilintar) committed 1mo ago

5e54d51

March 17, 2026

common/parser: add `--skip-chat-parsing` to force a pure content parser. (#20289)

Piotr Wilkin (ilintar) committed 1mo ago

d2ecd2d

March 12, 2026

test-backend-ops: allow loading tests from file and parsing model operators into file (#19896)

Ruben Ortlam committed 1mo ago

128142f

March 11, 2026

common : fix --n-cpu-moe, --cpu-moe for models with fused gate + up (#20416)

ddh0 committed 1mo ago

4a748b8

common/parser: handle reasoning budget (#20297)

Piotr Wilkin (ilintar) committed 1mo ago

acb7c79

March 8, 2026

llama: end-to-end tests (#19802)

Johannes Gäßler committed 1mo ago

a976ff0

March 6, 2026

Checkpoint every n tokens: squash (#20087)

Piotr Wilkin (ilintar) committed 1mo ago

f5ddcd1

webui: Agentic Loop + MCP Client with support for Tools, Resources and Prompts (#18655)

Aleksander Grygier committed 1mo ago

f6235a4

March 5, 2026

chore : correct typos [no ci] (#20041)

Marcel Petrick committed 1mo ago

92f7da0

February 27, 2026

server : support multiple model aliases via comma-separated --alias (#19926)

Pascal committed 1mo ago

2e7e638

February 23, 2026

llama : remove write/read of output ids/logits/embeddings (#18862)

Daniel Bevenius committed 1mo ago

2b6dfe8

February 18, 2026

common : make small string helpers as inline functions (#19693)

Adrien Gallouët committed 2mo ago

a569bda

February 16, 2026

common : inline functions (#18639)

Ivan Chikish committed 2mo ago

cceb1b4

February 11, 2026

common : remove unused token util functions (#19506)

Daniel Bevenius committed 2mo ago

3136a84

February 9, 2026

spec : remove check rate (#19377)

Sascha Rogmann committed 2mo ago

292f690

January 30, 2026

spec : add ngram-mod (#19164)

Georgi Gerganov committed 2mo ago

dabaa2e

January 28, 2026

spec : add self‑speculative decoding (no draft model required) + refactor (#18471)

Sascha Rogmann committed 2mo ago

72d3b18

llama : disable Direct IO by default (#19109)

Georgi Gerganov committed 2mo ago

c5c64f7

January 20, 2026

common, server : use the same User-Agent by default (#18957)

Adrien Gallouët committed 2mo ago

1c7cf94

cli : fix reasoning responses in CLI (#18961)

Xuan-Son Nguyen committed 2mo ago

2c1f199

January 15, 2026

llama : add adaptive-p sampler (#17927)

ddh0 committed 3mo ago

13f1e4a

January 12, 2026

server : add arg for disabling prompt caching (#18776)

Radoslav Gerganov committed 3mo ago

bcf7546

examples : add --kv-unified to batched example (#18774)

Daniel Bevenius committed 3mo ago

4150da9

January 8, 2026

llama-fit-params: free memory target per device (#18679)

Johannes Gäßler committed 3mo ago

64848de

llama : add `use_direct_io` flag for model loading (#18166)

Julius Tischbein committed 3mo ago

2038101

January 7, 2026

examples : add debug utility/example (#18464)

Daniel Bevenius committed 3mo ago

ffba4f2

January 4, 2026

sampling : add support for backend sampling (#17004)

Daniel Bevenius committed 3mo ago

d3dce4e

December 27, 2025

llama: fix magic number of 999 for GPU layers (#18266)

Johannes Gäßler committed 3mo ago

026d2ad

December 21, 2025

server: add auto-sleep after N seconds of idle (#18228)

Xuan-Son Nguyen committed 3mo ago

ddcb75d

December 17, 2025

server: (webui) add --webui-config (#18028)

Pascal committed 4mo ago

6ce3d85

December 15, 2025

llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)

Johannes Gäßler committed 4mo ago

b1f3a6e

December 14, 2025

common : refactor common_sampler + grammar logic changes (#17937)

Georgi Gerganov committed 4mo ago

254098a

December 10, 2025

cli: enable jinja by default (#17911)

Xuan-Son Nguyen committed 4mo ago

34a6d86

server: add presets (config) when using multiple models (#17859)

Pascal committed 4mo ago

f32ca51

cli: new CLI experience (#17824)

Xuan-Son Nguyen committed 4mo ago

6c21317

December 7, 2025

common : change --color to accept on/off/auto, default to auto (#17827)

Sigbjørn Skjæret committed 4mo ago

2257758

December 4, 2025

build : move _WIN32_WINNT definition to headers (#17736)

Adrien Gallouët committed 4mo ago

ef75a89

December 2, 2025

server: add --media-path for local media files (#17697)

Xuan-Son Nguyen committed 4mo ago

13628d8

December 1, 2025

server: introduce API for serving / loading / unloading multiple models (#17470)

Xuan-Son Nguyen committed 4mo ago

ec18edf

common: improve verbosity level definitions (#17630)

Xuan-Son Nguyen committed 4mo ago

7733409

November 25, 2025

llama: introduce support for model-embedded sampling parameters (#17120)

Aaron Teo committed 4mo ago

877566d

November 20, 2025

common : more accurate sampling timing (#17382)

Georgi Gerganov committed 4mo ago

196f508

November 10, 2025

batched-bench : add "separate text gen" mode (#17103)

Georgi Gerganov committed 5mo ago

f914544

November 8, 2025

arg: add --cache-list argument to list cached models (#17073)

Xuan-Son Nguyen committed 5mo ago

aa3b7a9

November 5, 2025

server : do not default to multiple slots with speculative decoding (#17017)

Georgi Gerganov committed 5mo ago

13b339b

November 3, 2025

mtmd: add --image-min/max-tokens (#16921)

Xuan-Son Nguyen committed 5mo ago

070ff4d

October 12, 2025

common : update presets (#16504)

Georgi Gerganov committed 6mo ago

4b2dae3