Commits: core/http/endpoints/openai/inference.go - mudler/LocalAI

mudler / LocalAI UNCLAIMED

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

0 0 66 Go

COMMITS

/ core/http/endpoints/openai/inference.go

master

April 9, 2026

fix: thinking models with tools returning empty content (reasoning-only retry loop) (#9290)

Ettore Di Giacinto committed 1mo ago

13a6ed7

April 6, 2026

fix(chat): do not retry if we had chatdeltas or tooldeltas from backend (#9244)

Ettore Di Giacinto committed 2mo ago

773489e

March 29, 2026

feat: add distributed mode (#9124)

Ettore Di Giacinto committed 2mo ago

59108fb

March 16, 2026

chore: refactor endpoints to use same inferencing path, add automatic retrial mechanism in case of errors (#9029)

Ettore Di Giacinto committed 2mo ago

ee96e5e

March 8, 2026

feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838)

Ettore Di Giacinto committed 2mo ago

b2f81bf

March 5, 2026

feat: pass-by metadata to predict options (#8795)

Ettore Di Giacinto committed 3mo ago

580517f

November 16, 2025

feat: add support to logitbias and logprobs (#7283)

Ettore Di Giacinto committed 6mo ago

d7f9f3a

November 7, 2025

feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120)

Ettore Di Giacinto committed 7mo ago

02cc8cb

August 14, 2025

feat(backends): add system backend, refactor (#6059)

Ettore Di Giacinto committed 9mo ago

089efe0

June 29, 2025

fix(gallery): automatically install model from name (#5757)

Ettore Di Giacinto committed 11mo ago

33f9ee0

February 10, 2025

feat: Centralized Request Processing middleware (#3847)

Dave committed 1y ago

3cddf24

January 17, 2025

feat: add machine tag and inference timings (#4577)

mintyleaf committed 1y ago

96f8ec0

September 19, 2024

feat(api): allow to pass audios to backends (#3603)

Ettore Di Giacinto committed 1y ago

191bc2e

feat(api): allow to pass videos to backends (#3601)

Ettore Di Giacinto committed 1y ago

fbb9fac

June 23, 2024

chore: fix go.mod module (#2635)

Sertaç Özercan committed 1y ago

5866fc8

April 17, 2024

Revert #1963 (#2056)

Ettore Di Giacinto committed 2y ago

af9e5a2

April 13, 2024

refactor: backend/service split, channel-based llm flow (#1963)

Dave committed 2y ago

eed5706

April 11, 2024

feat: use tokenizer.apply_chat_template() in vLLM (#1990)

Ludovic Leroux committed 2y ago

12c0d94

March 1, 2024

refactor: move remaining api packages to core (#1731)

Dave committed 2y ago

1c31268