COMMITS
/ examples/quantize-stats/quantize-stats.cpp November 14, 2024
D
ggml : build backends as libraries (#10256)
Diego Devesa committed
October 8, 2024
D
ggml : fix BLAS with unsupported types (#9775)
Diego Devesa committed
September 7, 2024
G
llama : refactor sampling v2 (#9294)
Georgi Gerganov committed
July 12, 2024
G
ggml : minor naming changes (#8433)
Georgi Gerganov committed
April 29, 2024
O
Improve usability of --model-url & related flags (#6930)
Olivier Chafik committed
February 3, 2024
M
refactor : switch to emplace_back to avoid extra object (#5291)
Michael Klimenko committed
January 30, 2024
K
SOTA 3-bit quants (#5196)
Kawrakow committed
December 7, 2023
G
llama : per-layer KV cache + quantum K cache (#4309)
Georgi Gerganov committed
November 2, 2023
C
build : link against build info instead of compiling against it (#3879)
cebtenzzre committed
September 28, 2023
September 18, 2023
C
make : restore build-info.h dependency for several targets (#3205)
Cebtenzzre committed
September 15, 2023
C
examples : add compiler version and target to build info (#2998)
Cebtenzzre committed
C
check C++ code with -Wmissing-declarations (#3184)
Cebtenzzre committed
September 7, 2023
C
fix some warnings from gcc and clang-tidy (#3038)
Cebtenzzre committed
August 21, 2023
G
gguf : new file format with flexible meta data (beta) (#2398)
Georgi Gerganov committed
July 5, 2023
S
ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)
Stephan Walter committed
June 24, 2023
D
llama : make model stateless and context stateful (llama_state) (#1797)
Didzis Gosko committed
June 16, 2023
B
build : fix and ignore MSVC warnings (#1889)
Borislav Stanimirov committed
June 5, 2023
K
ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
Kawrakow committed
May 17, 2023
S
Remove unused n_parts parameter (#1509)
Stephan Walter committed
May 1, 2023
D
Add git-based build information for better issue tracking (#1232)
DannyDaemonic committed
April 20, 2023
K
llama : multi-threaded quantization (#1075)
Kawrakow committed
April 17, 2023
G
quantize-stats : fix bug in --type argument
Georgi Gerganov committed
April 14, 2023
P
Expose type name from ggml (#970)
Pavol Rusnak committed
April 13, 2023
G
llama : merge llama_internal.h into llama.h
Georgi Gerganov committed
April 8, 2023
C
Rewrite loading code to try to satisfy everyone:
comex committed
April 7, 2023
U
Add quantize-stats command for testing quantization (#728)
unbounded committed