COMMITS
/ docs/content/advanced/model-configuration.md May 29, 2026
L
feat(reasoning): honor per-request reasoning_effort on chat completions (#10082)
LocalAI [bot] committed
May 21, 2026
L
feat(llama-cpp): make server-side prompt cache work by default (#9925)
LocalAI [bot] committed
May 16, 2026
L
feat(llama-cpp): bump to MTP-merge SHA and automatically set MTP defaults (#9852)
LocalAI [bot] committed
May 14, 2026
L
feat(llama-cpp): expose 12 missing common_params via options[] (#9814)
LocalAI [bot] committed
May 12, 2026
L
feat(llama-cpp): bump to `1ec7ba0c`, adapt grpc-server, expose new spec-decoding options (#9765)
LocalAI [bot] committed
April 14, 2026
E
feat(backend): add turboquant llama.cpp-fork backend (#9355)
Ettore Di Giacinto committed
April 5, 2026
E
feat(llama.cpp): wire speculative decoding settings (#9238)
Ettore Di Giacinto committed
March 12, 2026
L
docs: Document GPU auto-fit mode limitations and trade-offs (closes #8562) (#8954)
LocalAI [bot] committed
March 5, 2026
E
feat: pass-by metadata to predict options (#8795)
Ettore Di Giacinto committed
January 29, 2026
R
feat(realtime): Add audio conversations (#6245)
Richard Palethorpe committed
January 20, 2026
E
feat(openresponses): Support reasoning blocks (#8133)
Ettore Di Giacinto committed
November 19, 2025
E
feat: docs revamp (#7313)
Ettore Di Giacinto committed