COMMITS
/ pkg/model/initializers.go May 5, 2026
E
feat(concurrency-groups): per-model exclusive groups for backend loading (#9662)
Ettore Di Giacinto committed
May 4, 2026
April 12, 2026
E
feat(backends): add ik-llama-cpp (#9326)
Ettore Di Giacinto committed
April 8, 2026
E
fix(nodes): better detection if nodes goes down or model is not available (#9274)
Ettore Di Giacinto committed
March 29, 2026
E
feat: add distributed mode (#9124)
Ettore Di Giacinto committed
March 13, 2026
L
Remove HuggingFace backend support (#8971)
LocalAI [bot] committed
February 1, 2026
December 25, 2025
E
feat: disable force eviction (#7725)
Ettore Di Giacinto committed
December 21, 2025
E
chore(refactor): move logging to common package based on slog (#7668)
Ettore Di Giacinto committed
December 16, 2025
E
fix: correctly propagate error during model load (#7610)
Ettore Di Giacinto committed
December 12, 2025
E
feat(loader): enhance single active backend to support LRU eviction (#7535)
Ettore Di Giacinto committed
December 11, 2025
E
fix: make sure to close on errors (#7521)
Ettore Di Giacinto committed
August 28, 2025
E
fix: register backends to model-loader during installation (#6159)
Ettore Di Giacinto committed
July 22, 2025
E
feat: refactor build process, drop embedded backends (#5875)
Ettore Di Giacinto committed
July 19, 2025
E
feat: split piper from main binary (#5858)
Ettore Di Giacinto committed
July 18, 2025
E
feat: do not bundle llama-cpp anymore (#5790)
Ettore Di Giacinto committed
June 15, 2025
E
feat: Add backend gallery (#5607)
Ettore Di Giacinto committed
June 12, 2025
K
Fix Typos in Comments and Error Messages (#5637)
kilavvy committed
April 25, 2025
E
feat(llama.cpp/clip): inject gpu options if we detect GPUs (#5243)
Ettore Di Giacinto committed
April 1, 2025
E
feat(loader): enhance single active backend by treating as singleton (#5107)
Ettore Di Giacinto committed
March 31, 2025
E
fix: race during stop of active backends (#5106)
Ettore Di Giacinto committed
February 17, 2025
B
fix: change initialization order of llama-cpp-avx512 to go before avx2 variant (#4837)
Bas Hulsken committed
February 10, 2025
D
feat: Centralized Request Processing middleware (#3847)
Dave committed
February 6, 2025
E
chore(llama-ggml): drop deprecated backend (#4775)
Ettore Di Giacinto committed
January 24, 2025
E
chore: detect and enable avx512 builds (#4675)
Ettore Di Giacinto committed
January 23, 2025
E
chore(refactor): group cpu cap detection (#4674)
Ettore Di Giacinto committed
E
feat(transformers): add support to Mamba (#4669)
Ettore Di Giacinto committed
January 22, 2025
E
chore(stablediffusion-ncn): drop in favor of ggml implementation (#4652)
Ettore Di Giacinto committed
January 18, 2025
G
chore: remove deprecated tinydream backend (#4631)
Gianluca Boiano committed
E
feat(transformers): merge sentencetransformers backend (#4624)
Ettore Di Giacinto committed
January 17, 2025
E
chore: alias transformers-musicgen to transformers (#4623)
Ettore Di Giacinto committed
November 27, 2024
E
feat(backends): Drop bert.cpp (#4272)
Ettore Di Giacinto committed
November 26, 2024
E
feat(models): use rwkv from llama.cpp (#4264)
Ettore Di Giacinto committed
November 8, 2024
E
chore(refactor): drop unnecessary code in loader (#4096)
Ettore Di Giacinto committed
October 31, 2024
E
fix(grpc): pass by modelpath (#4023)
Ettore Di Giacinto committed
October 15, 2024
E
fix(llama.cpp): consider also native builds (#3839)
Ettore Di Giacinto committed
October 11, 2024
E
fix(llama-cpp): consistently select fallback (#3789)
Ettore Di Giacinto committed
October 4, 2024
E
feat(multimodal): allow to template placeholders (#3728)
Ettore Di Giacinto committed
October 2, 2024
E
fix(initializer): correctly reap dangling processes (#3717)
Ettore Di Giacinto committed
E
feat: track internally started models by ID (#3693)
Ettore Di Giacinto committed
September 26, 2024
E
chore(refactor): track grpcProcess in the model structure (#3663)
Ettore Di Giacinto committed
September 25, 2024
E
feat(api): list loaded models in `/system` (#3661)
Ettore Di Giacinto committed
September 17, 2024
E
chore(refactor): drop duplicated shutdown logics (#3589)
Ettore Di Giacinto committed
September 5, 2024
E
feat: add endpoint to list system informations (#3449)
Ettore Di Giacinto committed
August 25, 2024
E
fix(model-loading): keep track of open GRPC Clients (#3377)
Ettore Di Giacinto committed
August 23, 2024
August 7, 2024
E
chore: drop gpt4all.cpp (#3106)
Ettore Di Giacinto committed
July 23, 2024
E
fix(cuda): downgrade to 12.0 to increase compatibility range (#2994)
Ettore Di Giacinto committed
July 1, 2024
E
fix(initializer): do select backends that exist (#2694)
Ettore Di Giacinto committed
E
feat(backend): fallback with autodetect (#2693)
Ettore Di Giacinto committed