COMMITS
/ common/arg.cpp December 10, 2024
B
imatrix : Add imatrix to --no-context-shift (#10766)
Bartowski committed
Y
December 6, 2024
X
common : bring back --no-warmup to server (#10686)
Xuan Son Nguyen committed
December 2, 2024
X
llama : add enum for built-in chat templates (#10623)
Xuan Son Nguyen committed
November 28, 2024
J
common: fix warning message when no GPU found (#10564)
Johannes Gäßler committed
November 27, 2024
X
common : fix duplicated file name with hf_repo and hf_file (#10550)
Xuan Son Nguyen committed
November 25, 2024
D
llama : accept a list of devices to use to offload a model (#10497)
Diego Devesa committed
G
speculative : refactor and add a simpler example (#10362)
Georgi Gerganov committed
November 16, 2024
J
llama/ex: remove --logdir argument (#10339)
Johannes Gäßler committed
October 29, 2024
G
llama : remove Tail-Free sampling (#10071)
Georgi Gerganov committed
October 25, 2024
W
llama : add DRY sampler (#9702)
wwoodsTM committed
M
llama: string_split fix (#10022)
Michael Podvitskiy committed
October 22, 2024
D
arg : fix typo in embeddings argument help [no ci] (#9994)
Daniel Bevenius committed
October 21, 2024
D
arg : fix attention non-causal arg value hint (#9985)
Daniel Bevenius committed
October 15, 2024
M
sampling : add XTC sampler (#9742)
MaggotHATE committed
October 13, 2024
G
server : reuse cached context chunks (#9866)
Georgi Gerganov committed
October 12, 2024
G
server : remove self-extend features (#9860)
Georgi Gerganov committed
G
server : remove legacy system_prompt feature (#9857)
Georgi Gerganov committed
G
llama : improve infill support and special token detection (#9798)
Georgi Gerganov committed
October 10, 2024
D
common : use common_ prefix for common library functions (#9805)
Diego Devesa committed
D
rpc : add backend registry / device interfaces (#9812)
Diego Devesa committed
October 8, 2024
X
server : better security control for public deployments (#9776)
Xuan Son Nguyen committed
October 4, 2024
D
Fixed RNG seed docs (#9723)
Daniel Kleine committed
September 28, 2024
G
llama : add reranking support (#9510)
Georgi Gerganov committed
September 25, 2024
X
server : add more env vars, improve gen-docs (#9635)
Xuan Son Nguyen committed
September 23, 2024
X
server : add --no-context-shift option (#9607)
Xuan Son Nguyen committed
September 17, 2024
B
arg : add env variable for parallel (#9513)
Bert Wagner committed
September 16, 2024
V
main : option to disable context shift (#9484)
Vinesh Janarthanan committed
September 15, 2024
G
common : reimplement logging (#9418)
Georgi Gerganov committed
September 13, 2024
G
llama : llama_perf + option to disable timings during decode (#9355)
Georgi Gerganov committed
September 10, 2024
X
arg : bring back missing ifdef (#9411)
Xuan Son Nguyen committed
M
enable --special arg for llama-server (#9419)
matteo committed
S
llama : move random seed generation to the samplers (#9398)
slaren committed
September 9, 2024
X
common : move arg parser code to `arg.cpp` (#9388)
Xuan Son Nguyen committed