COMMITS
/ tools/server/server-task.h March 19, 2026
R
server: Add cached_tokens info to oaicompat responses (#19361)
Ryan Goulden committed
March 6, 2026
P
Autoparser - complete refactoring of parser architecture (#18675)
Piotr Wilkin (ilintar) committed
February 25, 2026
G
server : support multi-modal context checkpoints (#19849)
Georgi Gerganov committed
January 30, 2026
G
server : wrap around the "id_slot" parameter (#19207)
Georgi Gerganov committed
January 21, 2026
손
server: /v1/responses (partial) (#18486)
손희준 committed
January 20, 2026
X
cli : fix reasoning responses in CLI (#18961)
Xuan-Son Nguyen committed
January 19, 2026
X
server : refactor oai_parser_opt, move it to server_chat_params (#18937)
Xuan-Son Nguyen committed
January 15, 2026
X
server: improve slots scheduling for n_cmpl (#18789)
Xuan-Son Nguyen committed
G
context : reserve new scheduler when graph topology changes (#18547)
Georgi Gerganov committed
January 9, 2026
X
server: fix n_cmpl not skipping processing prompt (#18663)
Xuan-Son Nguyen committed
January 6, 2026
December 22, 2025
X
server: prevent data race from HTTP threads (#18263)
Xuan-Son Nguyen committed
December 10, 2025
X
cli: new CLI experience (#17824)
Xuan-Son Nguyen committed
December 8, 2025
X
server: delegate result_state creation to server_task (#17835)
Xuan-Son Nguyen committed
G
server : make cache_reuse configurable per request (#17858)
Georgi Gerganov committed
December 6, 2025
X
server: support multiple generations from one prompt (OAI "n" option) (#17775)
Xuan-Son Nguyen committed
December 4, 2025
X
server: move msg diffs tracking to HTTP thread (#17740)
Xuan-Son Nguyen committed
November 28, 2025
F
server : add Anthropic Messages API support (#17570)
Fredrik Hultin committed
November 24, 2025
X
server: split server.cpp code into server/common/task/queue (#17362)
Xuan-Son Nguyen committed