COMMITS
/ common/sampling.cpp March 31, 2026
G
March 27, 2026
A
common : inhibit lazy grammar sampler while reasoning is active (#20970)
Aldehir Rojas committed
March 19, 2026
P
common/parser: add proper reasoning tag prefill reading (#20424)
Piotr Wilkin (ilintar) committed
March 11, 2026
P
common/parser: handle reasoning budget (#20297)
Piotr Wilkin (ilintar) committed
January 15, 2026
D
llama : add adaptive-p sampler (#17927)
ddh0 committed
G
context : reserve new scheduler when graph topology changes (#18547)
Georgi Gerganov committed
January 4, 2026
D
sampling : add support for backend sampling (#17004)
Daniel Bevenius committed
January 3, 2026
A
common/grammar : replace problematic backtracking regex `[\s\S]*` (#18342)
Aldehir Rojas committed
December 17, 2025
G
common : restore grammar-based rejection sampling (#18137)
Georgi Gerganov committed
December 14, 2025
G
common : refactor common_sampler + grammar logic changes (#17937)
Georgi Gerganov committed
November 20, 2025
G
common : more accurate sampling timing (#17382)
Georgi Gerganov committed
September 24, 2025
J
llama: print memory breakdown on exit (#15860)
Johannes Gäßler committed
August 31, 2025
G
sampling : optimize samplers by reusing bucket sort (#15665)
Georgi Gerganov committed
May 25, 2025
O
`server`: streaming of tool calls and thoughts when `--jinja` is on (#12379)
Olivier Chafik committed
May 7, 2025
May 5, 2025
March 28, 2025
J
llama: fix error on bad grammar (#12628)
Johannes Gäßler committed
March 5, 2025
March 4, 2025
M
ggml : portability fixes for VS 2017 (#12150)
mgroeber9110 committed
February 13, 2025
O
V
sampling: add Top-nσ sampler (#11223)
Vinesh Janarthanan committed
February 2, 2025
M
sampling : support for llguidance grammars (#10224)
Michał Moskal committed
January 30, 2025
January 12, 2025
G
llama : add `llama_vocab`, functions -> methods, naming (#11110)
Georgi Gerganov committed
December 16, 2024
G
sampling : refactor + optimize penalties sampler (#10803)
Georgi Gerganov committed
November 25, 2024
G
speculative : refactor and add a simpler example (#10362)
Georgi Gerganov committed
October 29, 2024
G
llama : remove Tail-Free sampling (#10071)
Georgi Gerganov committed
October 25, 2024
W
llama : add DRY sampler (#9702)
wwoodsTM committed
October 21, 2024
G
llama : default sampling changes + greedy update (#9897)
Georgi Gerganov committed
October 15, 2024
G
llama : add infill sampler (#9896)
Georgi Gerganov committed
M
sampling : add XTC sampler (#9742)
MaggotHATE committed
October 10, 2024
D
common : use common_ prefix for common library functions (#9805)
Diego Devesa committed
September 24, 2024
G
sampling : avoid expensive softmax during greedy sampling (#9605)
Georgi Gerganov committed
September 15, 2024
G
common : reimplement logging (#9418)
Georgi Gerganov committed
September 13, 2024
G
llama : llama_perf + option to disable timings during decode (#9355)
Georgi Gerganov committed
September 10, 2024
S
llama : move random seed generation to the samplers (#9398)
slaren committed
September 9, 2024
X
common : move arg parser code to `arg.cpp` (#9388)
Xuan Son Nguyen committed
September 7, 2024
G
llama : fix empty ring buffer push (#9358)
Georgi Gerganov committed
G
llama : refactor sampling v2 (#9294)
Georgi Gerganov committed
July 23, 2024
G
llama : move vocab, grammar and sampling into separate files (#8508)
Georgi Gerganov committed
July 8, 2024
K
common : preallocate sampling token data vector (#8363)
Kevin Wang committed
K
common : avoid unnecessary logits fetch (#8358)
Kevin Wang committed
June 25, 2024
D
llama : return nullptr from llama_grammar_init (#8093)
Daniel Bevenius committed
May 22, 2024
G
common : normalize naming style (#7462)
Georgi Gerganov committed
May 21, 2024
O
`grammars`: fix resampling logic regression (#7424)
Olivier Chafik committed
May 11, 2024
J
server: fix reported top tokens for temperature 0 (#7203)
Johannes Gäßler committed
May 7, 2024
J
server: fix incorrectly reported token probabilities (#7125)
Johannes Gäßler committed
April 29, 2024
D
sampling : use std::random_device{}() for default random seed (#6962)
David Renshaw committed
April 24, 2024
J
Server: fix seed for multiple slots (#6835)
Johannes Gäßler committed
March 24, 2024
M
sampling : deduplicated code for probability distribution access (#6240)
Minsoo Cheong committed