Commits: Makefile - ggml-org/llama.cpp

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ Makefile

master

August 20, 2025

make : remove make in favor of CMake (#15449)

Daniel Bevenius committed 7mo ago

37f10f9

June 9, 2025

ggml-cpu : split arch-specific implementations (#13892)

xctan committed 9mo ago

f470bc3

May 7, 2025

examples : remove infill (#13283)

Georgi Gerganov committed 11mo ago

4773d7a

May 5, 2025

mtmd : rename llava directory to mtmd (#13311)

Xuan-Son Nguyen committed 11mo ago

9b61acf

May 2, 2025

llama : move end-user examples to tools directory (#13249)

Diego Devesa committed 11mo ago

1d36b36

April 15, 2025

CUDA/HIP: Share the same unified memory allocation logic. (#12934)

David Huang committed 11mo ago

84778e9

March 10, 2025

musa: support new arch mp_31 and update doc (#12296)

R0CKSTAR committed 1y ago

2513645

February 22, 2025

CUDA: app option to compile without FlashAttention (#12025)

Johannes Gäßler committed 1y ago

a28e0d5

February 21, 2025

MUSA: support ARM64 and enable dp4a .etc (#11843)

Bodhi committed 1y ago

0b3863f

February 18, 2025

tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)

Olivier Chafik committed 1y ago

63e489c

February 15, 2025

repo : update links to new url (#11886)

Georgi Gerganov committed 1y ago

68ff663

February 2, 2025

CUDA: use mma PTX instructions for FlashAttention (#11583)

Johannes Gäßler committed 1y ago

864a0b6

January 30, 2025

Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639)

Olivier Chafik committed 1y ago

8b576b6

January 21, 2025

Add Jinja template support (#11016)

Olivier Chafik committed 1y ago

6171c9d

December 14, 2024

llama : add Qwen2VL support + multimodal RoPE (#10361)

HimariO committed 1y ago

ba1cb19

December 7, 2024

ggml : refactor online repacking (#10446)

Djip007 committed 1y ago

19d8762

December 3, 2024

server : (web ui) Various improvements, now use vite as bundler (#10599)

Xuan Son Nguyen committed 1y ago

91c36c2

December 2, 2024

make : deprecate (#10514)

Georgi Gerganov committed 1y ago

8648c52

December 1, 2024

build: update Makefile comments for C++ version change (#10598)

Wang Qin committed 1y ago

43957ef

November 29, 2024

ggml : move AMX to the CPU backend (#10570)

Diego Devesa committed 1y ago

7cc2d2c

November 26, 2024

Fix HIP flag inconsistency & build docs (#10524)

Tristan Druyen committed 1y ago

be0e350

mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (#10516)

R0CKSTAR committed 1y ago

249cd93

November 25, 2024

Introduce llama-run (#10291)

Eric Curtin committed 1y ago

0cc6375

ggml : add support for dynamic loading of backends (#10469)

Diego Devesa committed 1y ago

5931c1f

speculative : refactor and add a simpler example (#10362)

Georgi Gerganov committed 1y ago

d9d54e4

November 19, 2024

Fix missing file renames in Makefile due to changes in commit ae8de6d50a (#10413)

Anthony Van de Gejuchte committed 1y ago

3952a22

November 17, 2024

metal : refactor kernel args into structs (#10238)

Georgi Gerganov committed 1y ago

cf32a9b

CUDA: remove DMMV, consolidate F16 mult mat vec (#10318)

Johannes Gäßler committed 1y ago

c3ea58a

November 16, 2024

make : add ggml-opt (#0)

Georgi Gerganov committed 1y ago

a4200ca

tests : remove test-grad0

Georgi Gerganov committed 1y ago

84274a1

make : auto-determine dependencies (#0)

Georgi Gerganov committed 1y ago

8ee0d09

November 15, 2024

ggml : fix some build issues

slaren committed 1y ago

883d206

backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921)

Charles Xu committed 1y ago

1607a5e

November 14, 2024

ggml : build backends as libraries (#10256)

Diego Devesa committed 1y ago

ae8de6d

November 8, 2024

metal : opt-in compile flag for BF16 (#10218)

Georgi Gerganov committed 1y ago

ec450d3

November 7, 2024

server : revamp chat UI with vuejs and daisyui (#10175)

Xuan Son Nguyen committed 1y ago

a71d81c

November 3, 2024

ggml : move CPU backend to a separate file (#10144)

Diego Devesa committed 1y ago

9f40989

November 1, 2024

llama : add simple-chat example (#10124)

Diego Devesa committed 1y ago

a6744e4

October 18, 2024

add amx kernel for gemm (#8998)

Ma Mingfei committed 1y ago

60ce97c

October 2, 2024

ggml-backend : add device and backend reg interfaces (#9707)

Diego Devesa committed 1y ago

c83ad6d

examples : remove benchmark (#9704)

Georgi Gerganov committed 1y ago

148844f

September 22, 2024

musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)

R0CKSTAR committed 1y ago

c35e586

September 16, 2024

cmake : do not hide GGML options + rename option (#9465)

Georgi Gerganov committed 1y ago

19514d6

September 15, 2024

common : reimplement logging (#9418)

Georgi Gerganov committed 1y ago

6262d13

September 13, 2024

server : add loading html page while model is loading (#9468)

Xuan Son Nguyen committed 1y ago

feff4aa

September 12, 2024

riscv : modify Makefile and add a RISCV_VECT to print log info (#9442)

Ahmad Tameem committed 1y ago

2b00fa7

September 10, 2024

make : do not run llama-gen-docs when building (#9399)

slaren committed 1y ago

fb3f249

September 9, 2024

common : move arg parser code to `arg.cpp` (#9388)

Xuan Son Nguyen committed 1y ago

bfe76d4

September 7, 2024

common : refactor arg parser (#9308)

Xuan Son Nguyen committed 1y ago

1b9ae51

llama : refactor sampling v2 (#9294)

Georgi Gerganov committed 1y ago

df270ef