COMMITS
/ Makefile December 7, 2024
D
ggml : refactor online repacking (#10446)
Djip007 committed
December 3, 2024
X
server : (web ui) Various improvements, now use vite as bundler (#10599)
Xuan Son Nguyen committed
December 2, 2024
G
make : deprecate (#10514)
Georgi Gerganov committed
December 1, 2024
W
build: update Makefile comments for C++ version change (#10598)
Wang Qin committed
November 29, 2024
D
ggml : move AMX to the CPU backend (#10570)
Diego Devesa committed
November 26, 2024
T
Fix HIP flag inconsistency & build docs (#10524)
Tristan Druyen committed
R
mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (#10516)
R0CKSTAR committed
November 25, 2024
E
Introduce llama-run (#10291)
Eric Curtin committed
D
ggml : add support for dynamic loading of backends (#10469)
Diego Devesa committed
G
speculative : refactor and add a simpler example (#10362)
Georgi Gerganov committed
November 19, 2024
A
Fix missing file renames in Makefile due to changes in commit ae8de6d50a (#10413)
Anthony Van de Gejuchte committed
November 17, 2024
G
metal : refactor kernel args into structs (#10238)
Georgi Gerganov committed
J
CUDA: remove DMMV, consolidate F16 mult mat vec (#10318)
Johannes Gäßler committed
November 16, 2024
G
make : add ggml-opt (#0)
Georgi Gerganov committed
G
tests : remove test-grad0
Georgi Gerganov committed
G
make : auto-determine dependencies (#0)
Georgi Gerganov committed
November 15, 2024
S
ggml : fix some build issues
slaren committed
C
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921)
Charles Xu committed
November 14, 2024
D
ggml : build backends as libraries (#10256)
Diego Devesa committed
November 8, 2024
G
metal : opt-in compile flag for BF16 (#10218)
Georgi Gerganov committed
November 7, 2024
X
server : revamp chat UI with vuejs and daisyui (#10175)
Xuan Son Nguyen committed
November 3, 2024
D
ggml : move CPU backend to a separate file (#10144)
Diego Devesa committed
November 1, 2024
D
llama : add simple-chat example (#10124)
Diego Devesa committed
October 18, 2024
M
add amx kernel for gemm (#8998)
Ma Mingfei committed
October 2, 2024
D
ggml-backend : add device and backend reg interfaces (#9707)
Diego Devesa committed
G
examples : remove benchmark (#9704)
Georgi Gerganov committed
September 22, 2024
September 16, 2024
G
cmake : do not hide GGML options + rename option (#9465)
Georgi Gerganov committed
September 15, 2024
G
common : reimplement logging (#9418)
Georgi Gerganov committed
September 13, 2024
X
server : add loading html page while model is loading (#9468)
Xuan Son Nguyen committed
September 12, 2024
A
riscv : modify Makefile and add a RISCV_VECT to print log info (#9442)
Ahmad Tameem committed
September 10, 2024
S
make : do not run llama-gen-docs when building (#9399)
slaren committed
September 9, 2024
X
common : move arg parser code to `arg.cpp` (#9388)
Xuan Son Nguyen committed
September 7, 2024
X
common : refactor arg parser (#9308)
Xuan Son Nguyen committed
G
llama : refactor sampling v2 (#9294)
Georgi Gerganov committed
August 14, 2024
0
Vulkan Optimizations and Fixes (#8959)
0cc4m committed
August 9, 2024
G
make : fix llava obj file race (#8946)
Georgi Gerganov committed
T
llava : support MiniCPM-V-2.5 (#7599)
tc-mb committed
August 8, 2024
P
make : clean llamafile objects (#8923)
Pablo Duboue committed
August 7, 2024
S
make : use C compiler to build metal embed object (#8899)
slaren committed
July 31, 2024
C
Build: Fix potential race condition (#8781)
Clint Herron committed
July 27, 2024
R
feat: Support Moore Threads GPU (#8383)
R0CKSTAR committed
S
ggml : reduce hash table reset cost (#8698)
slaren committed
July 25, 2024
X
examples : remove `finetune` and `train-text-from-scratch` (#8669)
Xuan Son Nguyen committed
July 23, 2024
X
examples : Fix `llama-export-lora` example (#8607)
Xuan Son Nguyen committed
G
llama : move vocab, grammar and sampling into separate files (#8508)
Georgi Gerganov committed
July 16, 2024
J
make/cmake: add missing force MMQ/cuBLAS for HIP (#8515)
Johannes Gäßler committed
July 13, 2024
B
vulkan : cmake integration (#8119)
bandoti committed
July 11, 2024
N
ggml : add NVPL BLAS support (#8329) (#8425)
Nicholai Tukanov committed
July 10, 2024
C
Name Migration: Build the deprecation-warning 'main' binary every time (#8404)
Clint Herron committed