COMMITS
/ pkg/grpc/client.go May 25, 2026
R
feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802)
Richard Palethorpe committed
May 13, 2026
R
feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page (#9801)
Richard Palethorpe committed
May 5, 2026
E
feat(api): add /v1/audio/diarization endpoint with sherpa-onnx + vibevoice.cpp (#9654)
Ettore Di Giacinto committed
May 4, 2026
R
feat: add LocalVQE backend and audio transformations UI (#9640)
Richard Palethorpe committed
April 23, 2026
E
feat: voice recognition (#9500)
Ettore Di Giacinto committed
April 22, 2026
E
April 14, 2026
E
feat: wire transcription for llama.cpp, add streaming support (#9353)
Ettore Di Giacinto committed
March 29, 2026
E
feat: add distributed mode (#9124)
Ettore Di Giacinto committed
March 21, 2026
E
feat(quantization): add quantization backend (#9096)
Ettore Di Giacinto committed
E
feat: add (experimental) fine-tuning support with TRL (#9088)
Ettore Di Giacinto committed
March 13, 2026
R
feat(realtime): WebRTC support (#8790)
Richard Palethorpe committed
January 30, 2026
E
feat(tts): add support for streaming mode (#8291)
Ettore Di Giacinto committed
January 22, 2026
E
feat: detect thinking support from backend automatically if not explicitly set (#8167)
Ettore Di Giacinto committed
November 9, 2025
E
feat: respect context and add request cancellation (#7187)
Ettore Di Giacinto committed
July 27, 2025
E
feat(rfdetr): add object detection API (#5923)
Ettore Di Giacinto committed
April 26, 2025
E
feat(video-gen): add endpoint for video generation (#5247)
Ettore Di Giacinto committed
April 19, 2025
E
chore: bump grpc limits to 50MB (#5212)
Ettore Di Giacinto committed
December 18, 2024
M
feat: stream tokens usage (#4415)
mintyleaf committed
December 8, 2024
E
Revert "feat: include tokens usage for streamed output" (#4336)
Ettore Di Giacinto committed
November 28, 2024
M
feat: include tokens usage for streamed output (#4282)
mintyleaf committed
November 20, 2024
E
feat(silero): add Silero-vad backend (#4204)
Ettore Di Giacinto committed
October 1, 2024
S
feat: Add Get Token Metrics to GRPC server (#3687)
siddimore committed
September 2, 2024
D
August 25, 2024
E
fix(model-loading): keep track of open GRPC Clients (#3377)
Ettore Di Giacinto committed
August 24, 2024
D
feat: elevenlabs `sound-generation` api (#3355)
Dave committed
June 23, 2024
S
chore: fix go.mod module (#2635)
Sertaç Özercan committed
April 29, 2024
D
April 24, 2024
E
feat(rerankers): Add new backend, support jina rerankers API (#2121)
Ettore Di Giacinto committed
April 17, 2024
E
Revert #1963 (#2056)
Ettore Di Giacinto committed
April 13, 2024
D
March 22, 2024
R
feat(stores): Vector store backend (#1795)
Richard Palethorpe committed
February 21, 2024
D
January 23, 2024
C
feat(grpc): backend SPI pluggable in embedding mode (#1621)
coyzeng committed
January 7, 2024
E
feat: more embedded models, coqui fixes, add model usage and description (#1556)
Ettore Di Giacinto committed
January 5, 2024
E
Revert "[Refactor]: Core/API Split" (#1550)
Ettore Di Giacinto committed
D
[Refactor]: Core/API Split (#1506)
Dave committed
November 26, 2023
E
feat: initial watchdog implementation (#1341)
Ettore Di Giacinto committed
November 16, 2023
E
feat: queue up requests if not running parallel requests (#1296)
Ettore Di Giacinto committed
August 20, 2023
E
fix: drop racy code, refactor and group API schema (#931)
Ettore Di Giacinto committed
August 18, 2023
E
feat: add --single-active-backend to allow only one backend active at the time (#925)
Ettore Di Giacinto committed
D
Usage Features (#863)
Dave committed
July 27, 2023
E
fix: use bytes in gRPC proto instead of strings (#813)
Ettore Di Giacinto committed
July 14, 2023
E
feat: move other backends to grpc
Ettore Di Giacinto committed
E
feat: move llama to a grpc
Ettore Di Giacinto committed
E
feat: add falcon ggllm via grpc client
Ettore Di Giacinto committed