COMMITS
/ core/http/endpoints/openai/chat.go May 25, 2026
L
R
feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802)
Richard Palethorpe committed
May 22, 2026
L
fix(openai): stream usage non-zero when tools are enabled (#9941)
LocalAI [bot] committed
May 14, 2026
L
fix(streaming): comply with OpenAI usage / stream_options spec (#9815)
LocalAI [bot] committed
May 8, 2026
R
chore: Security hardening (#9719)
Richard Palethorpe committed
April 28, 2026
E
feat: localai assistant chat modality (#9602)
Ettore Di Giacinto committed
April 21, 2026
E
fix(streaming): dedupe content, recover reasoning, unique tool_call IDs in deferred flush (#9470)
Ettore Di Giacinto committed
April 9, 2026
E
fix(streaming): deduplicate tool call emissions during streaming (#9292)
Ettore Di Giacinto committed
E
fix: thinking models with tools returning empty content (reasoning-only retry loop) (#9290)
Ettore Di Giacinto committed
April 6, 2026
E
fix(chat): do not retry if we had chatdeltas or tooldeltas from backend (#9244)
Ettore Di Giacinto committed
April 5, 2026
E
fix(autoparser): correctly pass by logprobs (#9239)
Ettore Di Giacinto committed
April 4, 2026
E
fix(reasoning): suppress partial tag tokens during autoparser warm-up
Ettore Di Giacinto committed
E
fix(reasoning): warm-up
Ettore Di Giacinto committed
E
fix(reasoning): accumulate and strip reasoning tags from autoparser results (#9227)
Ettore Di Giacinto committed
R
feat(api): Allow coding agents to interactively discover how to control and configure LocalAI (#9084)
Richard Palethorpe committed
E
feat(autoparser): prefer chat deltas from backends when emitted (#9224)
Ettore Di Giacinto committed
March 29, 2026
E
feat: add distributed mode (#9124)
Ettore Di Giacinto committed
March 21, 2026
E
feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092)
Ettore Di Giacinto committed
March 20, 2026
R
feat(ui, openai): Structured errors and link to traces in error toast (#9068)
Richard Palethorpe committed
March 16, 2026
E
chore: refactor endpoints to use same inferencing path, add automatic retrial mechanism in case of errors (#9029)
Ettore Di Giacinto committed
March 15, 2026
E
feat: support streaming mode for tool calls in agent mode, fix interleaved thinking stream (#9023)
Ettore Di Giacinto committed
March 11, 2026
E
feat(ui): MCP Apps, mcp streaming and client-side support (#8947)
Ettore Di Giacinto committed
March 8, 2026
E
feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838)
Ettore Di Giacinto committed
March 1, 2026
E
fix: retry when LLM returns empty messages (#8704)
Ettore Di Giacinto committed
February 2, 2026
A
fix(api): Add missing field in initial OpenAI streaming response (#8341)
Alex O'Connell committed
January 20, 2026
E
feat(openresponses): Support reasoning blocks (#8133)
Ettore Di Giacinto committed
E
fix(reasoning): support models with reasoning without starting thinking tag (#8132)
Ettore Di Giacinto committed
January 10, 2026
E
feat(api): support 'reasoning' api field (#7959)
Ettore Di Giacinto committed
January 5, 2026
E
feat(function): Add tool streaming, XML Tool Call Parsing Support (#7865)
Ettore Di Giacinto committed
December 21, 2025
E
chore(refactor): move logging to common package based on slog (#7668)
Ettore Di Giacinto committed
November 16, 2025
E
feat: add support to logitbias and logprobs (#7283)
Ettore Di Giacinto committed
November 14, 2025
E
feat: migrate to echo and enable cancellation of non-streaming requests (#7270)
Ettore Di Giacinto committed
E
chore(deps): bump llama.cpp to `c4abcb2457217198efdd67d02675f5fddb7071c2` (#7266)
Ettore Di Giacinto committed
November 9, 2025
C
fix(api): SSE streaming format to comply with specification (#7182)
Copilot committed
E
feat: respect context and add request cancellation (#7187)
Ettore Di Giacinto committed
November 7, 2025
E
feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120)
Ettore Di Giacinto committed
October 8, 2025
E
fix(grammars): handle empty parameters on object types (#6409)
Ettore Di Giacinto committed
October 5, 2025
E
fix: add strict mode check for no action function (#6294)
Ettore Di Giacinto committed
September 11, 2025
M
fix(chat): use proper finish_reason for tool/function calling (#6243)
Mário Freitas committed
August 28, 2025
E
chore: stream errors while streaming SSE (#6160)
Ettore Di Giacinto committed
August 14, 2025
E
feat(backends): add system backend, refactor (#6059)
Ettore Di Giacinto committed
August 6, 2025
E
feat: add reasoning effort and metadata to template (#5981)
Ettore Di Giacinto committed
July 24, 2025
D
fix: untangle pkg and core (#5896)
Dave committed
June 29, 2025
E
fix(gallery): automatically install model from name (#5757)
Ettore Di Giacinto committed
February 10, 2025
D
feat: Centralized Request Processing middleware (#3847)
Dave committed
February 6, 2025
E
fix(openai): consistently return stop reason (#4771)
Ettore Di Giacinto committed
January 18, 2025
M
chore(docs): extra-Usage and Machine-Tag docs (#4627)
mintyleaf committed
January 17, 2025
M
feat: add machine tag and inference timings (#4577)
mintyleaf committed
December 8, 2024
E
Revert "feat: include tokens usage for streamed output" (#4336)
Ettore Di Giacinto committed
E
feat(template): read jinja templates from gguf files (#4332)
Ettore Di Giacinto committed