Commits: examples/embedding/embedding.cpp - ggml-org/llama.cpp

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ examples/embedding/embedding.cpp

master

March 31, 2026

common : move up common_init() and fix Windows UTF-8 logs (#21176)

Adrien Gallouët committed 4d ago

41361c8

March 4, 2026

Fix locale-dependent float printing in GGUF metadata (#17331)

SamareshSingh committed 1mo ago

cb8f4fa

January 5, 2026

model : add LFM2-ColBert-350M (#18607)

Tarek Dakhran committed 2mo ago

73d284a

December 14, 2025

common : refactor common_sampler + grammar logic changes (#17937)

Georgi Gerganov committed 3mo ago

254098a

November 28, 2025

ggml : add GGML_SCHED_NO_REALLOC option to disable reallocations in ggml_backend_sched (#17276)

Diego Devesa committed 4mo ago

e072b20

October 28, 2025

embedding: add raw option for --embd-output-format (#16541)

Sam Malayek committed 5mo ago

1c1409e

September 25, 2025

llama : add support for qwen3 reranker (#15824)

Douglas Hanley committed 6mo ago

b5bd037

July 30, 2025

tests : update for LLAMA_SET_ROWS=1 (#14961)

Georgi Gerganov committed 8mo ago

00131d6

July 16, 2025

llama : add high-throughput mode (#14363)

Georgi Gerganov committed 8mo ago

225e7a1

June 20, 2025

llama : improve sep token handling (#14272)

Sigbjørn Skjæret committed 9mo ago

88fc854

June 6, 2025

llama : deprecate llama_kv_self_ API (#14030)

Georgi Gerganov committed 10mo ago

745aa53

llama : support multiple classifier outputs and labels (#13940)

Sigbjørn Skjæret committed 10mo ago

d17a809

May 26, 2025

examples : allow extracting embeddings from decoder contexts (#13797)

Georgi Gerganov committed 10mo ago

79c137f

May 8, 2025

context : allow cache-less context for embeddings (#13108)

Georgi Gerganov committed 11mo ago

6562e5a

April 24, 2025

embeddings : fix batch sizes (#13076)

Georgi Gerganov committed 11mo ago

226251e

March 13, 2025

llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)

Georgi Gerganov committed 1y ago

e0dbec0

March 4, 2025

ggml : portability fixes for VS 2017 (#12150)

mgroeber9110 committed 1y ago

5bbe6a9

January 12, 2025

llama : add `llama_vocab`, functions -> methods, naming (#11110)

Georgi Gerganov committed 1y ago

afa8a9e

January 3, 2025

llama : refactor `src/llama.cpp` (#10902)

Georgi Gerganov committed 1y ago

f66f582

October 10, 2024

common : use common_ prefix for common library functions (#9805)

Diego Devesa committed 1y ago

7eee341

September 28, 2024

llama : add reranking support (#9510)

Georgi Gerganov committed 1y ago

f4d2b88

September 15, 2024

common : reimplement logging (#9418)

Georgi Gerganov committed 1y ago

6262d13

September 13, 2024

llama : llama_perf + option to disable timings during decode (#9355)

Georgi Gerganov committed 1y ago

0abc6a2

September 10, 2024

llama : move random seed generation to the samplers (#9398)

slaren committed 1y ago

49006c6

September 9, 2024

common : move arg parser code to `arg.cpp` (#9388)

Xuan Son Nguyen committed 1y ago

bfe76d4

September 7, 2024

common : refactor arg parser (#9308)

Xuan Son Nguyen committed 1y ago

1b9ae51

llama : refactor sampling v2 (#9294)

Georgi Gerganov committed 1y ago

df270ef

August 10, 2024

Add support for encoder-only T5 models (#8900)

fairydreaming committed 1y ago

7c3f55c

August 5, 2024

common : Changed tuple to struct (TODO fix) (#8823)

Liu Jia committed 1y ago

0a4ce78

June 24, 2024

embedding : more cli arguments (#7458)

Yann Follet committed 1y ago

646ef4a

June 21, 2024

llama : allow pooled embeddings on any model (#7477)

Douglas Hanley committed 1y ago

80ea089

June 4, 2024

common : refactor cli arg parsing (#7675)

Georgi Gerganov committed 1y ago

1442677

May 22, 2024

common : normalize naming style (#7462)

Georgi Gerganov committed 1y ago

6ff1398

May 15, 2024

embedding : free the batch after execution (#7297)

dm4 committed 1y ago

ea3b059

May 11, 2024

llama : add Jina Embeddings architecture (#6826)

Joan Fontanals committed 1y ago

b83cc3f

April 9, 2024

BERT tokenizer fixes (#6498)

Jared Van Bortel committed 2y ago

1b67731

March 27, 2024

embedding : show full embedding for single prompt (#6342)

howlger committed 2y ago

1e13987

March 26, 2024

embedding : adjust `n_ubatch` value (#6296)

Minsoo Cheong committed 2y ago

deb7240

March 14, 2024

embedding : add EOS token if not present (#899)

Georgi Gerganov committed 2y ago

044ec4b

embedding : print all resulting embeddings (#899)

Georgi Gerganov committed 2y ago

68265eb

embedding : print cosine similarity (#899)

Georgi Gerganov committed 2y ago

0fd6c1f

March 13, 2024

llama : add pipeline parallelism support (#6017)

slaren committed 2y ago

f30ea47

March 9, 2024

server : normalize embeddings (#5956)

SeungWon Jeong committed 2y ago

fb215c3

March 4, 2024

llama : fix embeddings (#5796)

Georgi Gerganov committed 2y ago

29ae62d

February 16, 2024

ggml : add numa options (#5377)

bmwl committed 2y ago

f486f6e

February 13, 2024

llama : support batched embeddings (#5466)

Douglas Hanley committed 2y ago

03bf161

February 11, 2024

Add support for BERT embedding models (#5423)

Douglas Hanley committed 2y ago

2891c8a

November 2, 2023

build : link against build info instead of compiling against it (#3879)

cebtenzzre committed 2y ago

b12fa0d

September 28, 2023

llama.cpp : split llama_context_params into model and context params (#3301)

slaren committed 2y ago

16bc66d

llama : custom attention mask + parallel decoding + no context swaps (#3228)

Georgi Gerganov committed 2y ago

ec89379