COMMITS
/ examples/simple/simple.cpp March 4, 2026
S
Fix locale-dependent float printing in GGUF metadata (#17331)
SamareshSingh committed
September 17, 2025
J
examples : support encoder-decoder models in the simple example (#16002)
Jie Fu (傅杰) committed
May 19, 2025
N
fix: check model pointer validity before use (#13631)
Nick committed
January 12, 2025
G
llama : add `llama_vocab`, functions -> methods, naming (#11110)
Georgi Gerganov committed
January 6, 2025
G
llama : update llama_model API names (#11063)
Georgi Gerganov committed
November 25, 2024
D
ggml : add support for dynamic loading of backends (#10469)
Diego Devesa committed
October 18, 2024
X
llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)
Xuan Son Nguyen committed
October 10, 2024
D
examples : do not use common library in simple example (#9803)
Diego Devesa committed
September 15, 2024
G
common : reimplement logging (#9418)
Georgi Gerganov committed
September 13, 2024
G
llama : llama_perf + option to disable timings during decode (#9355)
Georgi Gerganov committed
September 9, 2024
X
common : move arg parser code to `arg.cpp` (#9388)
Xuan Son Nguyen committed
S
llama : minor sampling refactor (2) (#9386)
slaren committed
September 7, 2024
X
common : refactor arg parser (#9308)
Xuan Son Nguyen committed
G
llama : refactor sampling v2 (#9294)
Georgi Gerganov committed
June 4, 2024
G
common : refactor cli arg parsing (#7675)
Georgi Gerganov committed
April 21, 2024
P
llama : support Llama 3 HF conversion (#6745)
Pedro Cuenca committed
February 16, 2024
B
ggml : add numa options (#5377)
bmwl committed
December 4, 2023
D
simple : update error message for KV cache check (#4324)
Daniel Bevenius committed
October 27, 2023
T
simple : fix batch handling (#3803)
Thibault Terrasson committed
October 23, 2023
M
llama : remove token functions with `context` args in favor of `model` (#3720)
Marcus Dunn committed
October 18, 2023
G
speculative : add tree-based sampling example (#3624)
Georgi Gerganov committed
September 28, 2023
S
G
llama : custom attention mask + parallel decoding + no context swaps (#3228)
Georgi Gerganov committed
September 15, 2023
C
examples : add compiler version and target to build info (#2998)
Cebtenzzre committed
September 8, 2023
P
build : do not use _GNU_SOURCE gratuitously (#2035)
Przemysław Pawełczyk committed
August 27, 2023
G
llama : more tokenizer fixes (#2810)
Georgi Gerganov committed
August 21, 2023
G
gguf : new file format with flexible meta data (beta) (#2398)
Georgi Gerganov committed
August 4, 2023
B
build : fix several cast and printf warnings (#2499)
Borislav Stanimirov committed
July 10, 2023
E
mpi : add support for distributed inference via MPI (#2099)
Evan Miller committed
June 26, 2023
Z
ggml : add NUMA support (#1556)
zrm committed
June 24, 2023
D
llama : make model stateless and context stateful (llama_state) (#1797)
Didzis Gosko committed
June 16, 2023
S
examples : add "simple" (#1840)
SuperUserNameMan committed