COMMITS
June 16, 2026
H
fix: fix error injection's lint errors (#2363)
Haiyang Shi committed
June 15, 2026
J
Fix smart client regression (#2361)
Jiaxin Shan committed
J
feat(batch): Add batch smart client transport retry foundation (#2339)
Jiaxin Shan committed
H
feat(console): error injection framework (#2336)
Haiyang Shi committed
J
feat(batch): Enabling job informer + job list pagination (#2322)
Jingyuan committed
V
[docs] Add TRT-LLM support to multi-engine page (#2357)
Varun Gupta committed
R
V
X
[Docs] Add Brixbench usage documentation (#2352)
xiaoyu-xyz committed
June 14, 2026
V
fix(race-test): slo queue router (#2353)
Varun Gupta committed
J
Bump version to v0.7.0-rc.2 (#2350)
Jiaxin Shan committed
J
chore: upgrade CI action versions (#2349)
Jiaxin Shan committed
J
[Docs] Add console production setup docs (#2348)
Jiaxin Shan committed
J
[Docs] Add local-mode doc and update stable install to v0.6.0 (#2347)
Jiaxin Shan committed
June 13, 2026
J
[Docs] Update batch inference docs (#2337)
Jiaxin Shan committed
June 12, 2026
J
Decoupling redis client and redis libs. (#2324)
Jingyuan committed
J
Restore stand alone driver mode (#2323)
Jingyuan committed
J
[API] Add support for OpenAI Responses API (#2312)
Jan Staněk committed
June 11, 2026
V
chore: nit fix in slo_test race condition test (#2328)
Varun Gupta committed
V
V
refactor(pd): extract PrefillExecutor into pd/prefill/ package (#2320)
Varun Gupta committed
H
fix(console): add extraBody field to job struct (#2321)
Haiyang Shi committed
J
H
feat(rm): add time window to resource listing options (#2325)
Haiyang Shi committed
H
chore: refine kvcache related dockerfile and docs (#2297)
Haiyang Shi committed
June 10, 2026
V
feat(pd): extract EngineHandler and PodSelector abstractions for PD routing (#2308)
Varun Gupta committed
V
J
[Misc] Add exponential backoff to provisioning failure. (#2319)
Jingyuan committed