COMMITS
/ core/services/nodes/interfaces.go May 30, 2026
L
feat: prefix-cache-aware routing for distributed mode (#10071)
LocalAI [bot] committed
May 25, 2026
L
fix(distributed): persist per-model load info so reconciler survives frontend restart (#9981)
LocalAI [bot] committed
May 5, 2026
E
feat(concurrency-groups): per-model exclusive groups for backend loading (#9662)
Ettore Di Giacinto committed
May 4, 2026
April 27, 2026
E
feat(distributed): support multiple replicas of one model on the same node (#9583)
Ettore Di Giacinto committed
April 8, 2026
E
fix(autoscaling): extract load model from Route() and use as well when doing autoscale (#9270)
Ettore Di Giacinto committed
April 1, 2026
E
feat: add resume endpoint to undrain nodes (#9197)
Ettore Di Giacinto committed
March 31, 2026
E
feat: add node reconciler, allow to schedule to group of nodes, min/max autoscaler (#9186)
Ettore Di Giacinto committed
March 29, 2026
E
feat: add distributed mode (#9124)
Ettore Di Giacinto committed