Commits: core/backend/options.go - mudler/LocalAI

mudler / LocalAI UNCLAIMED

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

0 0 66 Go

COMMITS

/ core/backend/options.go

master

May 30, 2026

feat: prefix-cache-aware routing for distributed mode (#10071)

LocalAI [bot] committed 2d ago

a44bdb2

May 25, 2026

feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802)

Richard Palethorpe committed 7d ago

6a80e23

May 23, 2026

fix(traces): cap backend trace Data to keep admin UI responsive (#9960)

LocalAI [bot] committed 9d ago

1198d10

May 22, 2026

feat(config): default prompt_cache_all to true (#9951)

LocalAI [bot] committed 10d ago

c500461

May 5, 2026

fix(backend): resolve relative draft_model paths against the models dir (#9680)

LocalAI [bot] committed 27d ago

70cf8ac

April 28, 2026

feat(vllm): expose AsyncEngineArgs via generic engine_args YAML map (#9563)

Richard Palethorpe committed 1mo ago

4916f8c

March 31, 2026

feat: add node reconciler, allow to schedule to group of nodes, min/max autoscaler (#9186)

Ettore Di Giacinto committed 2mo ago

8862e3c

March 29, 2026

feat: add distributed mode (#9124)

Ettore Di Giacinto committed 2mo ago

59108fb

March 21, 2026

feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092)

Ettore Di Giacinto committed 2mo ago

031a36c

March 18, 2026

feat(ui): Per model backend logs and various fixes (#9028)

Richard Palethorpe committed 2mo ago

35d509d

March 15, 2026

fix: Automatically disable mmap for Intel SYCL backends (#9012) (#9015)

LocalAI [bot] committed 2mo ago

c6a5128

March 5, 2026

feat: pass-by metadata to predict options (#8795)

Ettore Di Giacinto committed 2mo ago

580517f

January 2, 2026

fix(llama.cpp/mmproj): fix loading mmproj in nested sub-dirs different from model path (#7832)

Ettore Di Giacinto committed 5mo ago

5f6c941

December 21, 2025

chore(refactor): move logging to common package based on slog (#7668)

Ettore Di Giacinto committed 5mo ago

c37785b

November 16, 2025

feat: add support to logitbias and logprobs (#7283)

Ettore Di Giacinto committed 6mo ago

d7f9f3a

October 10, 2025

fix(llama.cpp): correctly set grammar triggers (#6432)

Ettore Di Giacinto committed 7mo ago

cd1e112

August 31, 2025

feat(flash_attention): set auto for flash_attention in llama.cpp (#6168)

Ettore Di Giacinto committed 9mo ago

739573e

August 14, 2025

feat(backends): add system backend, refactor (#6059)

Ettore Di Giacinto committed 9mo ago

089efe0

July 22, 2025

feat: refactor build process, drop embedded backends (#5875)

Ettore Di Giacinto committed 10mo ago

98e5291

June 28, 2025

feat(llama.cpp): allow to set kv-overrides (#5745)

Ettore Di Giacinto committed 11mo ago

dfadc36

May 22, 2025

feat(llama.cpp): add reranking (#5396)

Ettore Di Giacinto committed 1y ago

3b0cf52

May 3, 2025

chore(defaults): enlarge defaults, drop gpu layers which is infered (#5308)

Ettore Di Giacinto committed 1y ago

b2f9fc8

April 19, 2025

chore(autogptq): drop archived backend (#5214)

Ettore Di Giacinto committed 1y ago

61cc76c

April 1, 2025

feat(loader): enhance single active backend by treating as singleton (#5107)

Ettore Di Giacinto committed 1y ago

2c425e9

March 5, 2025

chore(deps): update llama.cpp and sync with upstream changes (#4950)

Ettore Di Giacinto committed 1y ago

67f7bff

February 18, 2025

feat(vllm): Additional vLLM config options (Disable logging, dtype, and Per-Prompt media limits) (#4855)

Brandon Beiler committed 1y ago

6a6e1a0

February 2, 2025

feat(llama.cpp): Add support to grammar triggers (#4733)

Ettore Di Giacinto committed 1y ago

1d6afbd

January 17, 2025

chore(vall-e-x): Drop backend (#4619)

Ettore Di Giacinto committed 1y ago

7d0ac1e

December 6, 2024

feat(llama.cpp): expose cache_type_k and cache_type_v for quant of kv cache (#4329)

Ettore Di Giacinto committed 1y ago

d4c1746

December 3, 2024

feat(backend): add stablediffusion-ggml (#4289)

Ettore Di Giacinto committed 1y ago

44a5dac

November 8, 2024

chore(refactor): drop unnecessary code in loader (#4096)

Ettore Di Giacinto committed 1y ago

6daef00

November 5, 2024

feat(diffusers): allow multiple lora adapters (#4081)

Ettore Di Giacinto committed 1y ago

947224b

October 23, 2024

feat(vllm): expose 'load_format' (#3943)

Ettore Di Giacinto committed 1y ago

ae1ec4e

October 2, 2024

feat: track internally started models by ID (#3693)

Ettore Di Giacinto committed 1y ago

0965c6c

September 22, 2024

feat: auto load into memory on startup (#3627)

Sertaç Özercan committed 1y ago

ee21b00

July 15, 2024

feat(llama.cpp): support embeddings endpoints (#2871)

Ettore Di Giacinto committed 1y ago

35561ed

June 26, 2024

feat(options): add `repeat_last_n` (#2660)

Ettore Di Giacinto committed 1y ago

a8bfb6f

June 23, 2024

chore: fix go.mod module (#2635)

Sertaç Özercan committed 1y ago

5866fc8

May 13, 2024

feat(llama.cpp): add `flash_attention` and `no_kv_offloading` (#2310)

Ettore Di Giacinto committed 2y ago

e49ea01

April 26, 2024

fix: security scanner warning noise: error handlers part 1 (#2141)

Dave committed 2y ago

2cd4936

April 25, 2024

fix: reduce chmod permissions for created files and directories (#2137)

Dave committed 2y ago

c8dd8e5

April 20, 2024

Add tensor_parallel_size setting to vllm setting items (#2085)

Taikono-Himazin committed 2y ago

03adc1f

April 17, 2024

Revert #1963 (#2056)

Ettore Di Giacinto committed 2y ago

af9e5a2

April 13, 2024

refactor: backend/service split, channel-based llm flow (#1963)

Dave committed 2y ago

eed5706

April 6, 2024

fix(llama.cpp): set better defaults for llama.cpp (#1961)

Ettore Di Giacinto committed 2y ago

8342553

April 3, 2024

fix(seed): generate random seed per-request if -1 is set (#1952)

Ettore Di Giacinto committed 2y ago

ff77d3b

March 13, 2024

fix(config): set better defaults for inferencing (#1822)

Ettore Di Giacinto committed 2y ago

f895d06

March 7, 2024

feat(intel): add diffusers/transformers support (#1746)

Ettore Di Giacinto committed 2y ago

5d10184

March 1, 2024

Bump vLLM version + more options when loading models in vLLM (#1782)

Ludovic Leroux committed 2y ago

9394113

refactor: move remaining api packages to core (#1731)

Dave committed 2y ago

1c31268