Commits: examples/simple/simple.cpp - ggml-org/llama.cpp - Morph

SIGN IN SIGN UP

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ examples/simple/simple.cpp

master

March 4, 2026

S

Fix locale-dependent float printing in GGUF metadata (#17331)

SamareshSingh committed 1mo ago

September 17, 2025

J

examples : support encoder-decoder models in the simple example (#16002)

Jie Fu (傅杰) committed 6mo ago

May 19, 2025

N

fix: check model pointer validity before use (#13631)

Nick committed 10mo ago

January 12, 2025

G

llama : add `llama_vocab`, functions -> methods, naming (#11110)

Georgi Gerganov committed 1y ago

January 6, 2025

G

llama : update llama_model API names (#11063)

Georgi Gerganov committed 1y ago

November 25, 2024

D

ggml : add support for dynamic loading of backends (#10469)

Diego Devesa committed 1y ago

October 18, 2024

X

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)

Xuan Son Nguyen committed 1y ago

October 10, 2024

D

examples : do not use common library in simple example (#9803)

Diego Devesa committed 1y ago

September 15, 2024

G

common : reimplement logging (#9418)

Georgi Gerganov committed 1y ago

September 13, 2024

G

llama : llama_perf + option to disable timings during decode (#9355)

Georgi Gerganov committed 1y ago

September 9, 2024

X

common : move arg parser code to `arg.cpp` (#9388)

Xuan Son Nguyen committed 1y ago

S

llama : minor sampling refactor (2) (#9386)

slaren committed 1y ago

September 7, 2024

X

common : refactor arg parser (#9308)

Xuan Son Nguyen committed 1y ago

G

llama : refactor sampling v2 (#9294)

Georgi Gerganov committed 1y ago

June 4, 2024

G

common : refactor cli arg parsing (#7675)

Georgi Gerganov committed 1y ago

April 21, 2024

P

llama : support Llama 3 HF conversion (#6745)

Pedro Cuenca committed 1y ago

February 16, 2024

B

ggml : add numa options (#5377)

bmwl committed 2y ago

December 4, 2023

D

simple : update error message for KV cache check (#4324)

Daniel Bevenius committed 2y ago

October 27, 2023

T

simple : fix batch handling (#3803)

Thibault Terrasson committed 2y ago

October 23, 2023

M

llama : remove token functions with `context` args in favor of `model` (#3720)

Marcus Dunn committed 2y ago

October 18, 2023

G

speculative : add tree-based sampling example (#3624)

Georgi Gerganov committed 2y ago

September 28, 2023

S

llama.cpp : split llama_context_params into model and context params (#3301)

slaren committed 2y ago

G

llama : custom attention mask + parallel decoding + no context swaps (#3228)

Georgi Gerganov committed 2y ago

September 15, 2023

C

examples : add compiler version and target to build info (#2998)

Cebtenzzre committed 2y ago

September 8, 2023

P

build : do not use _GNU_SOURCE gratuitously (#2035)

Przemysław Pawełczyk committed 2y ago

August 27, 2023

G

llama : more tokenizer fixes (#2810)

Georgi Gerganov committed 2y ago

August 21, 2023

G

gguf : new file format with flexible meta data (beta) (#2398)

Georgi Gerganov committed 2y ago

August 4, 2023

B

build : fix several cast and printf warnings (#2499)

Borislav Stanimirov committed 2y ago

July 10, 2023

E

mpi : add support for distributed inference via MPI (#2099)

Evan Miller committed 2y ago

June 26, 2023

Z

ggml : add NUMA support (#1556)

zrm committed 2y ago

June 24, 2023

D

llama : make model stateless and context stateful (llama_state) (#1797)

Didzis Gosko committed 2y ago

June 16, 2023

S

examples : add "simple" (#1840)

SuperUserNameMan committed 2y ago