COMMITS
/ examples/lookup/lookup.cpp March 31, 2026
A
common : move up common_init() and fix Windows UTF-8 logs (#21176)
Adrien Gallouët committed
March 4, 2026
S
Fix locale-dependent float printing in GGUF metadata (#17331)
SamareshSingh committed
January 30, 2026
D
lookup, lookahead: fix crash when n_ctx not specified (#18729)
Daniele Pinna committed
January 28, 2026
S
spec : add self‑speculative decoding (no draft model required) + refactor (#18471)
Sascha Rogmann committed
December 14, 2025
G
common : refactor common_sampler + grammar logic changes (#17937)
Georgi Gerganov committed
June 6, 2025
G
llama : deprecate llama_kv_self_ API (#14030)
Georgi Gerganov committed
May 20, 2025
G
llama : remove llama_kv_cache_view API + remove deprecated (#13653)
Georgi Gerganov committed
March 13, 2025
G
llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)
Georgi Gerganov committed
January 12, 2025
G
llama : add `llama_vocab`, functions -> methods, naming (#11110)
Georgi Gerganov committed
January 3, 2025
G
llama : refactor `src/llama.cpp` (#10902)
Georgi Gerganov committed
November 25, 2024
G
speculative : refactor and add a simpler example (#10362)
Georgi Gerganov committed
October 18, 2024
X
llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)
Xuan Son Nguyen committed
October 10, 2024
D
common : use common_ prefix for common library functions (#9805)
Diego Devesa committed
September 15, 2024
G
common : reimplement logging (#9418)
Georgi Gerganov committed
September 13, 2024
G
llama : llama_perf + option to disable timings during decode (#9355)
Georgi Gerganov committed
September 9, 2024
X
common : move arg parser code to `arg.cpp` (#9388)
Xuan Son Nguyen committed
September 7, 2024
X
common : refactor arg parser (#9308)
Xuan Son Nguyen committed
G
llama : refactor sampling v2 (#9294)
Georgi Gerganov committed
August 5, 2024
L
common : Changed tuple to struct (TODO fix) (#8823)
Liu Jia committed
July 17, 2024
J
lookup: fibonacci hashing, fix crashes (#8548)
Johannes Gäßler committed
June 4, 2024
G
common : refactor cli arg parsing (#7675)
Georgi Gerganov committed
May 22, 2024
G
common : normalize naming style (#7462)
Georgi Gerganov committed
April 24, 2024
J
Server: fix seed for multiple slots (#6835)
Johannes Gäßler committed
April 21, 2024
P
llama : support Llama 3 HF conversion (#6745)
Pedro Cuenca committed
April 9, 2024
J
BERT tokenizer fixes (#6498)
Jared Van Bortel committed
March 23, 2024
J
lookup: complement data from context with general text statistics (#5479)
Johannes Gäßler committed
February 16, 2024
B
ggml : add numa options (#5377)
bmwl committed
February 11, 2024
J
lookup: add print for drafting performance (#5450)
Johannes Gäßler committed
December 22, 2023
L
lookup : add prompt lookup decoding example (#4484)
LeonEricsson committed