COMMITS
/ src/llama-impl.h March 11, 2026
G
llama : enable chunked fused GDN path (#20340)
Georgi Gerganov committed
March 7, 2026
A
ggml: add GATED_DELTA_NET op (#19504)
Aman Gupta committed
February 11, 2026
D
llama : refactor sampling_info to use buffer_view template (#19368)
Daniel Bevenius committed
December 3, 2025
H
ggml, llama : use defaulted constructors/destructors (#17649)
Herman Semenoff committed
August 30, 2025
J
llama: use FA + max. GPU layers by default (#15434)
Johannes Gäßler committed
February 12, 2025
B
cleanup: fix compile warnings associated with gnu_printf (#11811)
bandoti committed
January 3, 2025
G
llama : refactor `src/llama.cpp` (#10902)
Georgi Gerganov committed
September 24, 2024
G
log : add CONT level for continuing previous log entry (#9610)
Georgi Gerganov committed
September 15, 2024
G
common : reimplement logging (#9418)
Georgi Gerganov committed
September 8, 2024
S
llama : refactor samplers internal implementation (#9370)
slaren committed
September 7, 2024
G
llama : refactor sampling v2 (#9294)
Georgi Gerganov committed
August 26, 2024
J
llama : fix time complexity of string replacement (#9163)
Justine Tunney committed
August 9, 2024
G
llama : better replace_all (cont) (#8926)
Georgi Gerganov committed
July 23, 2024
G
llama : move vocab, grammar and sampling into separate files (#8508)
Georgi Gerganov committed