COMMITS
/ .github/workflows/server.yml June 1, 2026
G
speculative : fix n_outputs_max and remove draft-simple auto-enable (#23988)
Georgi Gerganov committed
E
ci: remove redundant or duplicate jobs (#23927)
Eve committed
May 28, 2026
G
ci : refactor (#23789)
Georgi Gerganov committed
May 27, 2026
G
common : fix env names to all have LLAMA_ARG_ prefix (#23778)
Georgi Gerganov committed
G
ci : fix windows ccaches (#23777)
Georgi Gerganov committed
G
ci : add ccache to server builds + fix undefined sanitizer build (#23763)
Georgi Gerganov committed
May 23, 2026
A
cmake/ui : refactor the build (#23352)
Aldehir Rojas committed
May 16, 2026
A
ui: Restructure repo to use `tools/ui` folder and `ui` / `UI` / `llama-ui` / `LLAMA_UI` naming (#23064)
Aleksander Grygier committed
May 14, 2026
A
webui: Move static build output from repo code to HF Bucket (#22937)
Aleksander Grygier committed
March 15, 2026
G
ci : split build.yml + server.yml (#20546)
Georgi Gerganov committed
February 8, 2026
S
ci : remove server job from webui and move slow test (#19424)
Sigbjørn Skjæret committed
February 7, 2026
G
ci : use -j param correctly when building with sanitizers (#19411)
Georgi Gerganov committed
February 3, 2026
G
ci : add sanitizer runs for server (#19291)
Georgi Gerganov committed
January 23, 2026
G
graph : utilize `ggml_build_forward_select()` to avoid reallocations (#18898)
Georgi Gerganov committed
January 21, 2026
P
ci : update GitHub Actions versions [no ci] (#18935)
Pádraic Slattery committed
January 14, 2026
A
refactor : remove libcurl, use OpenSSL when available (#18828)
Adrien Gallouët committed
January 4, 2026
D
sampling : add support for backend sampling (#17004)
Daniel Bevenius committed
December 16, 2025
S
ci : separate webui from server (#18072)
Sigbjørn Skjæret committed
November 22, 2025
A
ci : switch to BoringSSL on Server workflow (#17441)
Adrien Gallouët committed
November 21, 2025
A
ci : start using OpenSSL (#17235)
Adrien Gallouët committed
November 12, 2025
A
Update packages + upgrade Storybook to v10 (#17201)
Aleksander Grygier committed
September 17, 2025
A
SvelteKit-based WebUI (#14839)
Aleksander Grygier committed
June 4, 2025
D
ci : remove cuda 11.7 releases, switch runner to windows 2022 (#13997)
Diego Devesa committed
May 2, 2025
D
llama : move end-user examples to tools directory (#13249)
Diego Devesa committed
April 7, 2025
X
cmake : enable curl by default (#12761)
Xuan-Son Nguyen committed
March 3, 2025
D
ci : set GITHUB_ACTION env var for server tests (#12162)
Daniel Bevenius committed
February 6, 2025
X
server : (webui) migrate project to ReactJS with typescript (#11688)
Xuan-Son Nguyen committed
January 30, 2025
January 19, 2025
G
tests : increase timeout when sanitizers are enabled (#11300)
Georgi Gerganov committed
December 11, 2024
X
ci : pin nodejs to 22.11.0 (#10779)
Xuan Son Nguyen committed
December 3, 2024
X
server : (web ui) Various improvements, now use vite as bundler (#10599)
Xuan Son Nguyen committed
November 26, 2024
X
server : replace behave with pytest (#10416)
Xuan Son Nguyen committed
September 15, 2024
G
common : reimplement logging (#9418)
Georgi Gerganov committed
September 12, 2024
M
server : Add option to return token pieces in /tokenize endpoint (#9108)
Mathijs Henquet committed
June 26, 2024
G
llama : reorganize source code + improve CMake (#8006)
Georgi Gerganov committed
June 23, 2024
S
fix CI failures (#8066)
slaren committed
June 19, 2024
S
ggml : synchronize threads using barriers (#7993)
slaren committed
June 12, 2024
O
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
Olivier Chafik committed
June 10, 2024
S
ci : try win-2019 on server windows test (#7854)
slaren committed
May 20, 2024
G
server : fix temperature + disable some tests (#7409)
Georgi Gerganov committed
May 18, 2024
G
ci : re-enable sanitizer runs (#7358)
Georgi Gerganov committed
April 29, 2024
O
build(cmake): simplify instructions (`cmake -B build && cmake --build build ...`) (#6964)
Olivier Chafik committed
April 27, 2024
P
ci: server: tests python env on github container ubuntu latest / fix n_predict (#6935)
Pierrick Hymbert committed
April 26, 2024
P
ci: server: fix python installation (#6925)
Pierrick Hymbert committed
P
ci: server: fix python installation (#6922)
Pierrick Hymbert committed
P
ci: server: fix python installation (#6918)
Pierrick Hymbert committed
P
ci: fix concurrency for pull_request_target (#6917)
Pierrick Hymbert committed
April 22, 2024
P
ci: fix job are cancelling each other (#6781)
Pierrick Hymbert committed
April 4, 2024
M
ci: exempt master branch workflows from getting cancelled (#6486)
Minsoo Cheong committed
April 3, 2024
E
ci : update checkout, setup-python and upload-artifact to latest (#6456)
Ewout ter Hoeven committed