Commits: examples/parallel/parallel.cpp - ggml-org/llama.cpp

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ examples/parallel/parallel.cpp

master

March 31, 2026

common : move up common_init() and fix Windows UTF-8 logs (#21176)

Adrien Gallouët committed 3d ago

41361c8

March 4, 2026

Fix locale-dependent float printing in GGUF metadata (#17331)

SamareshSingh committed 1mo ago

cb8f4fa

December 14, 2025

common : refactor common_sampler + grammar logic changes (#17937)

Georgi Gerganov committed 3mo ago

254098a

July 18, 2025

parallel : add option for different RNG seeds (#14757)

Georgi Gerganov committed 8mo ago

2adf8d8

July 16, 2025

llama : add high-throughput mode (#14363)

Georgi Gerganov committed 8mo ago

225e7a1

June 6, 2025

llama : deprecate llama_kv_self_ API (#14030)

Georgi Gerganov committed 10mo ago

745aa53

June 1, 2025

parallel : fix n_junk == 0 (#13952)

Georgi Gerganov committed 10mo ago

c046217

May 31, 2025

llama : auto-batch preparation (#13845)

Georgi Gerganov committed 10mo ago

3f55f78

kv-cache : refactor + add llama_memory_state_i (#13746)

Georgi Gerganov committed 10mo ago

12d0188

May 30, 2025

parallel : increase the variability of the prompt lengths (#13927)

Georgi Gerganov committed 10mo ago

dd665cc

May 20, 2025

llama : remove llama_kv_cache_view API + remove deprecated (#13653)

Georgi Gerganov committed 10mo ago

a4090d1

May 17, 2025

parallel : add option for non-shared and larger prompts (#13598)

Georgi Gerganov committed 10mo ago

518329b

April 2, 2025

llama : refactor kv cache guard (#12695)

Georgi Gerganov committed 1y ago

a10b36c

April 1, 2025

common : refactor downloading system, handle mmproj with -hf option (#12694)

Xuan-Son Nguyen committed 1y ago

267c139

March 13, 2025

llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)

Georgi Gerganov committed 1y ago

e0dbec0

March 4, 2025

ggml : portability fixes for VS 2017 (#12150)

mgroeber9110 committed 1y ago

5bbe6a9

January 12, 2025

llama : add `llama_vocab`, functions -> methods, naming (#11110)

Georgi Gerganov committed 1y ago

afa8a9e

January 3, 2025

llama : refactor `src/llama.cpp` (#10902)

Georgi Gerganov committed 1y ago

f66f582

November 25, 2024

speculative : refactor and add a simpler example (#10362)

Georgi Gerganov committed 1y ago

d9d54e4

October 18, 2024

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)

Xuan Son Nguyen committed 1y ago

cda0e4b

October 10, 2024

common : use common_ prefix for common library functions (#9805)

Diego Devesa committed 1y ago

7eee341

September 15, 2024

common : reimplement logging (#9418)

Georgi Gerganov committed 1y ago

6262d13

September 13, 2024

llama : llama_perf + option to disable timings during decode (#9355)

Georgi Gerganov committed 1y ago

0abc6a2

September 9, 2024

common : move arg parser code to `arg.cpp` (#9388)

Xuan Son Nguyen committed 1y ago

bfe76d4

September 7, 2024

common : refactor arg parser (#9308)

Xuan Son Nguyen committed 1y ago

1b9ae51

llama : refactor sampling v2 (#9294)

Georgi Gerganov committed 1y ago

df270ef

August 5, 2024

common : Changed tuple to struct (TODO fix) (#8823)

Liu Jia committed 1y ago

0a4ce78

June 4, 2024

common : refactor cli arg parsing (#7675)

Georgi Gerganov committed 1y ago

1442677

May 22, 2024

common : normalize naming style (#7462)

Georgi Gerganov committed 1y ago

6ff1398

April 21, 2024

llama : support Llama 3 HF conversion (#6745)

Pedro Cuenca committed 1y ago

b97bc39

March 26, 2024

llama : greatly reduce output buffer memory usage (#6122)

compilade committed 2y ago

557410b

March 8, 2024

llama : support Mamba Selective State Space Models (#5328)

compilade committed 2y ago

c2101a2

February 16, 2024

ggml : add numa options (#5377)

bmwl committed 2y ago

f486f6e

November 23, 2023

llama : KV cache view API + better KV cache management (#4170)

Georgi Gerganov committed 2y ago

6b0a742

examples : fix typo in parallel example doc comment (#4181)

Daniel Bevenius committed 2y ago

9d5949f

November 2, 2023

build : link against build info instead of compiling against it (#3879)

cebtenzzre committed 2y ago

b12fa0d

October 23, 2023

llama : remove token functions with `context` args in favor of `model` (#3720)

Marcus Dunn committed 2y ago

5be6c80

October 20, 2023

sampling : refactor init to use llama_sampling_params (#3696)

Georgi Gerganov committed 2y ago

d1031cf

October 18, 2023

speculative : add tree-based sampling example (#3624)

Georgi Gerganov committed 2y ago

0e89203

October 11, 2023

common : fix mirostat state when using multiple sequences (#3543)

Kerfuffle committed 2y ago

70c29da

October 9, 2023

refact : fix convert script + zero out KV cache to avoid nans (#3523)

Georgi Gerganov committed 2y ago

fcca0a7

October 6, 2023

parallel : add option to load external prompt file (#3416)

pudepiedj committed 2y ago

a8777ad

October 3, 2023

llama : fix session saving/loading (#3400)

Georgi Gerganov committed 2y ago

ac2219f

September 28, 2023

llama.cpp : split llama_context_params into model and context params (#3301)

slaren committed 2y ago

16bc66d

llama : custom attention mask + parallel decoding + no context swaps (#3228)

Georgi Gerganov committed 2y ago

ec89379