Commit Graph

  • 3804497186 chore: ⬆️ Update leejet/stable-diffusion.cpp to 44cca3d626d301e2215d5e243277e8f0e65bfa78 (#9428) LocalAI [bot] 2026-04-19 23:39:07 +02:00
  • fda1c553a1 fix(distributed): stop queue loops on agent nodes + dead-letter cap (#9433) Ettore Di Giacinto 2026-04-19 23:38:43 +02:00
  • b27de08fff chore(gallery): fixup wan docs/wan-gallery-comments Ettore Di Giacinto 2026-04-19 21:31:22 +00:00
  • 44e7d9806b fix(distributed): stop queue loops on agent nodes + dead-letter cap distributed-ux-polish Ettore Di Giacinto 2026-04-19 21:27:05 +00:00
  • 510f791ccc feat(gallery): add stablediffusion-ggml-development meta backend Ettore Di Giacinto 2026-04-19 20:15:37 +00:00
  • 369c50a41c fix(turboquant): drop ignore-eos patch, bump fork to b8967-627ebbc (#9423) Ettore Di Giacinto 2026-04-19 21:05:21 +02:00
  • fbc93b0a34 fix(llama-cpp): default rms_norm_eps for Gemma 3 GGUFs missing the key issue-9414-gemma3-attention Ettore Di Giacinto 2026-04-19 16:15:26 +00:00
  • 75a63f87d8 feat(distributed): sync state with frontends, better backend management reporting (#9426) Ettore Di Giacinto 2026-04-19 17:55:53 +02:00
  • 9cd8d7951f fix(kokoros): implement audio_transcription_stream trait stub (#9422) Ettore Di Giacinto 2026-04-19 13:29:58 +02:00
  • 7a9d89fa54 feat(ui): shared FilterBar across the System page tabs Ettore Di Giacinto 2026-04-19 08:46:22 +00:00
  • ee34a52c5d feat(ui): NodeDistributionChip — shared per-node attribution component Ettore Di Giacinto 2026-04-19 08:39:59 +00:00
  • 92b9e22dc9 feat(ui): show cluster distribution of models in the System page Ettore Di Giacinto 2026-04-19 08:37:45 +00:00
  • f0ab68e352 feat(distributed): durable backend fan-out + state reconciliation Ettore Di Giacinto 2026-04-19 08:34:57 +00:00
  • 9373de9f9b feat(ui): polish the Nodes page so it reads like a product Ettore Di Giacinto 2026-04-19 08:20:52 +00:00
  • 1b3c951c85 feat(ui): surface backend upgrades in the System page Ettore Di Giacinto 2026-04-19 08:14:49 +00:00
  • 1f43762655 fix(distributed): detect backend upgrades across worker nodes Ettore Di Giacinto 2026-04-19 08:03:20 +00:00
  • 884bfb84c9 chore: ⬆️ Update ikawrakow/ik_llama.cpp to 8befd92ea5f702494ea9813fe42a52fb015db5fe (#9418) LocalAI [bot] 2026-04-19 09:27:11 +02:00
  • e94a9a8f10 chore: ⬆️ Update leejet/stable-diffusion.cpp to 7d33d4b2ddeafa672761a5880ec33bdff452504d (#9417) LocalAI [bot] 2026-04-19 09:26:58 +02:00
  • 054c4b4b45 feat(stable-diffusion.ggml): add support for video generation (#9420) Ettore Di Giacinto 2026-04-19 09:26:33 +02:00
  • 6e49dba27c chore: ⬆️ Update ggml-org/llama.cpp to 4f02d4733934179386cbc15b3454be26237940bb (#9415) LocalAI [bot] 2026-04-19 09:26:05 +02:00
  • e463820566 fix(ui): fix dark-theme colors in chat Ettore Di Giacinto 2026-04-18 23:01:01 +00:00
  • 8839a71c87 fix(rocm): add gfx1151 support and expose AMDGPU_TARGETS build-arg (#9410) Keith Mattix II 2026-04-18 13:39:40 -05:00
  • 117f6430b8 fix(turboquant): resolve common.h by detecting llama-common vs common target (#9413) Ettore Di Giacinto 2026-04-18 20:30:28 +02:00
  • 7809c5f5d0 fix(vision): propagate mtmd media marker from backend via ModelMetadata (#9412) Ettore Di Giacinto 2026-04-18 20:30:13 +02:00
  • ad742738cb chore: ⬆️ Update ikawrakow/ik_llama.cpp to 52efa12fdae390d1dca6ecd7ca00010fe51f651e (#9404) LocalAI [bot] 2026-04-18 09:21:32 +02:00
  • 86c673fd94 chore: ⬆️ Update ggml-org/whisper.cpp to 166c20b473d5f4d04052e699f992f625ea2a2fdd (#9403) LocalAI [bot] 2026-04-18 00:42:32 +02:00
  • c49feb546f fix(llama-cpp): rename linked target common -> llama-common (#9408) Ettore Di Giacinto 2026-04-18 00:42:05 +02:00
  • 844b0b760b chore(model gallery): 🤖 add 1 new models via gallery agent (#9400) LocalAI [bot] 2026-04-17 17:56:41 +02:00
  • 55c05211d3 chore(model gallery): 🤖 add 1 new models via gallery agent (#9399) LocalAI [bot] 2026-04-17 16:10:02 +02:00
  • a90a8cf1d0 fix(ci): switch gallery-agent to sigs.k8s.io/yaml (#9397) Ettore Di Giacinto 2026-04-17 10:10:42 +02:00
  • 12b069f9bd chore(deps): bump dompurify from 3.3.2 to 3.4.0 in /core/http/react-ui in the npm_and_yarn group across 1 directory (#9376) dependabot[bot] 2026-04-17 09:06:32 +02:00
  • 48e87db400 chore: bump inference defaults from unsloth (#9396) github-actions[bot] 2026-04-17 09:05:55 +02:00
  • 7dbd9c056a chore: ⬆️ Update ggml-org/llama.cpp to 4fbdabdc61c04d1262b581e1b8c0c3b119f688ff (#9381) LocalAI [bot] 2026-04-17 08:13:04 +02:00
  • 7c5d6162f7 fix(ui): rename model config files on save to prevent duplicates (#9388) Ettore Di Giacinto 2026-04-17 08:12:48 +02:00
  • 5837b14888 chore: ⬆️ Update TheTom/llama-cpp-turboquant to `45f8a066ed5f5bb38c695cec532f6cef9f4efa9d' (#9385) Ettore Di Giacinto 2026-04-17 08:12:21 +02:00
  • b6a68e5df4 chore: ⬆️ Update leejet/stable-diffusion.cpp to a564fdf642780d1df123f1c413b19961375b8346 (#9383) LocalAI [bot] 2026-04-17 08:11:55 +02:00
  • c6dfb4acaf chore: ⬆️ Update ikawrakow/ik_llama.cpp to eaf83865a132f66e8f49efe0e78491625942f068 (#9382) LocalAI [bot] 2026-04-17 08:11:41 +02:00
  • ec5935421c chore(model-gallery): ⬆️ update checksum (#9384) LocalAI [bot] 2026-04-16 22:41:52 +02:00
  • a0cbc46be9 refactor(tinygrad): reuse tinygrad.apps.llm instead of vendored Transformer (#9380) Ettore Di Giacinto 2026-04-16 22:41:18 +02:00
  • b4e30692a2 feat(backends): add sglang (#9359) Ettore Di Giacinto 2026-04-16 22:40:56 +02:00
  • 61d34ccb11 fix(ui): show also concrete backends in the backend list Ettore Di Giacinto 2026-04-16 17:44:25 +00:00
  • 7f88a3ba30 chore: ⬆️ Update leejet/stable-diffusion.cpp to c41c5ded7af85e01b7fe442ff7950c720706d53a (#9366) LocalAI [bot] 2026-04-16 09:04:33 +02:00
  • c4f309388e fix(gallery): correct gemma-4 model URIs returning 404 (#9379) Matt Van Horn 2026-04-16 02:51:20 -04:00
  • ab326a9c61 chore(deps): bump the npm_and_yarn group across 1 directory with 6 updates (#9373) dependabot[bot] 2026-04-16 08:23:03 +02:00
  • df2d25cee5 chore: ⬆️ Update ikawrakow/ik_llama.cpp to 1163af96cf6bb4a4b819f998f84c153a49768b99 (#9368) LocalAI [bot] 2026-04-16 01:13:08 +02:00
  • 96cd561d9d chore: ⬆️ Update ggml-org/llama.cpp to b3d758750a268bf93f084ccfa3060fb9a203192a (#9370) LocalAI [bot] 2026-04-16 01:12:39 +02:00
  • 08445b1b89 chore(model-gallery): ⬆️ update checksum (#9369) LocalAI [bot] 2026-04-16 01:12:01 +02:00
  • ad3c8c4832 fix(agents): handle embedding model dim changes on collection upload (#9365) Ettore Di Giacinto 2026-04-15 20:05:28 +02:00
  • 6f0051301b feat(backend): add tinygrad multimodal backend (experimental) (#9364) Ettore Di Giacinto 2026-04-15 19:48:23 +02:00
  • 8487058673 chore(model-gallery): ⬆️ update checksum (#9358) LocalAI [bot] 2026-04-15 01:25:59 +02:00
  • 62862ca06b chore: ⬆️ Update ggml-org/llama.cpp to fae3a28070fe4026f87bd6a544aba1b2d1896566 (#9357) LocalAI [bot] 2026-04-15 01:25:41 +02:00
  • 07e244d869 feat(swagger): update swagger (#9356) LocalAI [bot] 2026-04-15 01:25:24 +02:00
  • 95efb8a562 feat(backend): add turboquant llama.cpp-fork backend (#9355) Ettore Di Giacinto 2026-04-15 01:25:04 +02:00
  • 410d100cc3 chore(ui): improve visibility of forms, color palette Ettore Di Giacinto 2026-04-14 21:53:03 +00:00
  • 833b7e8557 chore(docs): update transcription endpoint Ettore Di Giacinto 2026-04-14 14:14:46 +00:00
  • 87e6de1989 feat: wire transcription for llama.cpp, add streaming support (#9353) Ettore Di Giacinto 2026-04-14 16:13:40 +02:00
  • b361d2ddd6 chore(gallery): add new llama.cpp supported models (qwen-asr, ocr) Ettore Di Giacinto 2026-04-14 10:04:50 +00:00
  • 1e4c4577bb fix(ci): small fixups Ettore Di Giacinto 2026-04-14 09:27:27 +00:00
  • 98fd9d5cc6 chore(deps): bump github.com/charmbracelet/glamour from 0.10.0 to 1.0.0 (#9340) dependabot[bot] 2026-04-14 11:17:05 +02:00
  • 0c725f5702 chore(deps): bump github.com/swaggo/echo-swagger from 1.4.1 to 1.5.2 (#9344) dependabot[bot] 2026-04-14 11:15:37 +02:00
  • 7661a4ffa5 chore(deps): bump github.com/testcontainers/testcontainers-go/modules/nats from 0.41.0 to 0.42.0 (#9341) dependabot[bot] 2026-04-14 11:15:26 +02:00
  • 24ad6e4be1 chore(deps): bump github.com/google/go-containerregistry from 0.21.3 to 0.21.5 (#9343) dependabot[bot] 2026-04-14 11:15:09 +02:00
  • c0648b8836 chore: ⬆️ Update ikawrakow/ik_llama.cpp to 55d3c05bf7b377deaa5dc84d255d9740a345a206 (#9348) LocalAI [bot] 2026-04-14 08:56:25 +02:00
  • a05c7def59 fix(e2e): update to new testcontainers Ettore Di Giacinto 2026-04-14 06:56:04 +00:00
  • 906acba8db chore: ⬆️ Update ggml-org/llama.cpp to e97492369888f5311e4d1f3beb325a36bbed70e9 (#9347) LocalAI [bot] 2026-04-14 08:54:25 +02:00
  • 4226ca4aee chore(deps): bump sentence-transformers from 5.2.3 to 5.4.0 in /backend/python/transformers (#9342) dependabot[bot] 2026-04-14 00:30:27 +02:00
  • c6d5dc3374 chore(model-gallery): ⬆️ update checksum (#9346) LocalAI [bot] 2026-04-13 23:00:13 +02:00
  • 7ce675af21 chore(gallery-agent): extract readme Ettore Di Giacinto 2026-04-13 20:31:49 +00:00
  • be1b8d56c9 fix(gallery): override parameters for flux kontext Ettore Di Giacinto 2026-04-13 22:29:17 +02:00
  • 97f087ed31 chore(deps): bump github.com/testcontainers/testcontainers-go from 0.41.0 to 0.42.0 (#9338) dependabot[bot] 2026-04-13 21:54:02 +02:00
  • 8691bbe663 chore(deps): bump actions/upload-pages-artifact from 4 to 5 (#9337) dependabot[bot] 2026-04-13 21:53:47 +02:00
  • 7998f96f11 chore(deps): bump softprops/action-gh-release from 2 to 3 (#9336) dependabot[bot] 2026-04-13 21:53:28 +02:00
  • cada97ee46 chore(gallery-agent): control bot via PR Ettore Di Giacinto 2026-04-13 19:52:48 +00:00
  • 3375ea1a2c chore(gallery-agent): simplify Ettore Di Giacinto 2026-04-13 19:50:31 +00:00
  • 0e7c0adee4 docs: document tool calling on vLLM and MLX backends Ettore Di Giacinto 2026-04-13 16:58:55 +00:00
  • 016da02845 feat: refactor shared helpers and enhance MLX backend functionality (#9335) Ettore Di Giacinto 2026-04-13 18:44:03 +02:00
  • daa0272f2e docs(agents): capture vllm backend lessons + runtime lib packaging (#9333) Ettore Di Giacinto 2026-04-13 11:09:57 +02:00
  • d67623230f feat(vllm): parity with llama.cpp backend (#9328) Ettore Di Giacinto 2026-04-13 11:00:29 +02:00
  • cd56a05c3e ci(vllm): disable tests-vllm-grpc job (heterogeneous runners) feat/vllm-parity Ettore Di Giacinto 2026-04-13 07:46:57 +00:00
  • 0f90d17aac feat(swagger): update swagger (#9329) LocalAI [bot] 2026-04-13 09:42:36 +02:00
  • ea32b8953f chore: ⬆️ Update ggml-org/llama.cpp to 1e9d771e2c2f1113a5ebdd0dc15bafe57dce64be (#9330) LocalAI [bot] 2026-04-13 09:42:18 +02:00
  • d74cd56b14 feat(vllm): bundle libnuma/libgomp via package.sh Ettore Di Giacinto 2026-04-12 20:20:21 +00:00
  • 017bdee4e4 ci(vllm): install libnuma1 + libgomp1 on bigger-runner Ettore Di Giacinto 2026-04-12 20:18:13 +00:00
  • c4dc495ea1 ci(vllm): install make + build deps on bigger-runner Ettore Di Giacinto 2026-04-12 20:08:09 +00:00
  • ea2bbabffd ci(vllm): use bigger-runner instead of source build Ettore Di Giacinto 2026-04-12 16:02:49 +00:00
  • bc7578bdb1 fix(hipblas): pin down rocm6.4 wheels on whisperx (7.x not supported) Ettore Di Giacinto 2026-04-12 15:27:51 +00:00
  • 329df11989 fix(vllm): build from source on CI to avoid SIGILL on prebuilt wheel Ettore Di Giacinto 2026-04-12 15:14:42 +00:00
  • c7f444d18b ci(test-extra): run vllm e2e tests on CPU Ettore Di Giacinto 2026-04-12 14:53:44 +00:00
  • e7f406169a test(e2e-backends): add tools capability + HF model name support Ettore Di Giacinto 2026-04-12 14:51:58 +00:00
  • 034a60bf76 ci(backend): build cpu-vllm container image Ettore Di Giacinto 2026-04-12 09:43:04 +00:00
  • c99188f106 fix(vllm): tool parser constructor compat + e2e tool calling test Ettore Di Giacinto 2026-04-12 09:15:16 +00:00
  • c2f73a987e fix(vllm): CPU build compatibility with vllm 0.14.1 Ettore Di Giacinto 2026-04-12 08:58:57 +00:00
  • b215843807 feat(vllm): CPU support + shared utils + vllm-omni feature parity Ettore Di Giacinto 2026-04-12 08:19:32 +00:00
  • 6786f05c64 feat(vllm): wire native tool/reasoning parsers + chat deltas + logprobs Ettore Di Giacinto 2026-04-12 08:19:14 +00:00
  • 6cf8263c30 feat(config): add vLLM parser defaults hook and importer auto-detection Ettore Di Giacinto 2026-04-12 08:11:46 +00:00
  • a30719f04a refactor(config): introduce backend hook system and migrate llama-cpp defaults Ettore Di Giacinto 2026-04-12 08:11:38 +00:00
  • 40b1c6f943 fix(schema): serialize ToolCallID and Reasoning in Messages.ToProto Ettore Di Giacinto 2026-04-12 08:11:24 +00:00
  • 9ca03cf9cc feat(backends): add ik-llama-cpp (#9326) Ettore Di Giacinto 2026-04-12 13:51:28 +02:00
  • 151ad271f2 feat(rocm): bump to 7.x (#9323) Ettore Di Giacinto 2026-04-12 08:51:30 +02:00
  • 2865f0f8d3 feat(ux): backend management enhancement (#9325) Ettore Di Giacinto 2026-04-12 00:35:22 +02:00