Commits: examples/perplexity/perplexity.cpp - ggml-org/llama.cpp

ggml-org / llama.cpp UNCLAIMED

LLM inference in C/C++

0 0 0 C++

COMMITS

/ examples/perplexity/perplexity.cpp

gguf-py

November 16, 2024

llama/ex: remove --logdir argument (#10339)

Johannes Gäßler committed 1y ago

4e54be0

October 18, 2024

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)

Xuan Son Nguyen committed 1y ago

cda0e4b

October 10, 2024

common : use common_ prefix for common library functions (#9805)

Diego Devesa committed 1y ago

7eee341

October 9, 2024

perplexity : fix integer overflow (#9783)

Georgi Gerganov committed 1y ago

e702206

September 23, 2024

perplexity : remove extra new lines after chunks (#9596)

Georgi Gerganov committed 1y ago

37f8c7b

September 20, 2024

perplexity : do not escape input data by default (#9548)

Sigbjørn Skjæret committed 1y ago

722ec1e

September 15, 2024

common : reimplement logging (#9418)

Georgi Gerganov committed 1y ago

6262d13

September 13, 2024

llama : llama_perf + option to disable timings during decode (#9355)

Georgi Gerganov committed 1y ago

0abc6a2

September 10, 2024

llama : move random seed generation to the samplers (#9398)

slaren committed 1y ago

49006c6

September 9, 2024

common : move arg parser code to `arg.cpp` (#9388)

Xuan Son Nguyen committed 1y ago

bfe76d4

September 7, 2024

common : refactor arg parser (#9308)

Xuan Son Nguyen committed 1y ago

1b9ae51

llama : refactor sampling v2 (#9294)

Georgi Gerganov committed 1y ago

df270ef

August 15, 2024

common : remove duplicate function llama_should_add_bos_token (#8778)

Zhenwei Jin committed 1y ago

4af8420

August 5, 2024

common : Changed tuple to struct (TODO fix) (#8823)

Liu Jia committed 1y ago

0a4ce78

July 3, 2024

ppl : fix n_seq_max for perplexity (#8277)

slaren committed 1y ago

5f2d4e6

June 12, 2024

`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)

Olivier Chafik committed 1y ago

1c641e6

June 4, 2024

common : refactor cli arg parsing (#7675)

Georgi Gerganov committed 1y ago

1442677

May 22, 2024

common : normalize naming style (#7462)

Georgi Gerganov committed 1y ago

6ff1398

May 18, 2024

perplexity : ndot progress and show stats with < 100 tasks (#7348)

strawberrymelonpanda committed 1y ago

ca57e0f

April 30, 2024

perplexity: more statistics, added documentation (#6936)

Johannes Gäßler committed 1y ago

a8f9b07

April 16, 2024

perplexity : require positive --ctx-size arg (#6695)

Georgi Gerganov committed 1y ago

58227ff

April 9, 2024

BERT tokenizer fixes (#6498)

Jared Van Bortel committed 2y ago

1b67731

March 26, 2024

llama : greatly reduce output buffer memory usage (#6122)

compilade committed 2y ago

557410b

March 13, 2024

llama : add pipeline parallelism support (#6017)

slaren committed 2y ago

f30ea47

March 11, 2024

llama : more consistent names of count variables (#5994)

Georgi Gerganov committed 2y ago

05b0621

March 9, 2024

perplexity : support using multiple sequences to allow larger batch sizes (#5946)

slaren committed 2y ago

d894f35

March 8, 2024

llama : support Mamba Selective State Space Models (#5328)

compilade committed 2y ago

c2101a2

February 18, 2024

ci : fix wikitext url + compile warnings (#5569)

Georgi Gerganov committed 2y ago

b1de968

ggml, common, examples, tests : fixed type arguments in printf (#5528)

Herman Semenov committed 2y ago

5d3de51

February 16, 2024

ggml : add numa options (#5377)

bmwl committed 2y ago

f486f6e

February 3, 2024

refactor : switch to emplace_back to avoid extra object (#5291)

Michael Klimenko committed 2y ago

52bb63c

February 2, 2024

perplexity : fix KL divergence calculations on Windows (#5273)

kalomaze committed 2y ago

1912211

January 23, 2024

Additional KL-divergence statistics (#5081)

Kawrakow committed 2y ago

44879ee

minor : clean-up some warnings and style (#5094)

Georgi Gerganov committed 2y ago

8975872

January 22, 2024

KL-divergence (#5076)

Kawrakow committed 2y ago

6f9939d

January 21, 2024

Add ability to evauate multiple choice tasks (#5047)

Kawrakow committed 2y ago

7dcbe39

January 20, 2024

perplexity : fix MSVC build after #5020 (#5043)

Jared Van Bortel committed 2y ago

97c1549

January 19, 2024

winogrande: evaluate log-probs in parallel (#5036)

Kawrakow committed 2y ago

7051aac

perplexity: avoid unnecessary alloocations and logit copies (#5035)

Kawrakow committed 2y ago

993fba8

perplexity : faster Winogrande via batching (#5024)

Georgi Gerganov committed 2y ago

8b20858

January 18, 2024

perplexity : fix winogrande N tasks option

Georgi Gerganov committed 2y ago

d391ae9

HellaSwag: speed up by parallelizing log-prob evaluation (#5020)

Kawrakow committed 2y ago

3e945cc

perplexity : faster HellaSwag via batching (#5017)

Georgi Gerganov committed 2y ago

ad19812

Add Winogrande evaluation (#5015)

Kawrakow committed 2y ago

682986a

January 16, 2024

perplexity : fix kv cache handling for hellaswag (#4981)

Georgi Gerganov committed 2y ago

959ef0c

November 17, 2023

Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)

Kerfuffle committed 2y ago

91f6499

November 2, 2023

build : link against build info instead of compiling against it (#3879)

cebtenzzre committed 2y ago

b12fa0d

October 29, 2023

Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843)

Kerfuffle committed 2y ago

6e08281

October 23, 2023

llama : remove token functions with `context` args in favor of `model` (#3720)

Marcus Dunn committed 2y ago

5be6c80

September 28, 2023

llama.cpp : split llama_context_params into model and context params (#3301)

slaren committed 2y ago

16bc66d