COMMITS
March 12, 2026
A
feat: add support for the Qwen3.5 family of models (#1624)
AlpinDale committed
February 17, 2026
January 21, 2026
A
fix: mark GLM-4 MoE Lite as an MLA model (#1621)
AlpinDale committed
January 19, 2026
A
[models] add support for GLM-4.7 Flash (#1620)
AlpinDale committed
January 7, 2026
A
fix: tokenizer server init (#1617)
AlpinDale committed
November 22, 2025
A
[cli] add CLI arg for selecting attention backend (#1612)
AlpinDale committed
November 19, 2025
A
[fix] log message killing compilation
AlpinDale committed
November 16, 2025
A
[logger][metrics] log number of cache hits in the request-level logger (#1611)
AlpinDale committed
A
[cli][diffusion] only import diffusion backend when it is called (#1610)
AlpinDale committed
November 10, 2025
A
[diffusion] `aphrodite diffusion` backend (#1607)
AlpinDale committed
A
[engine] add API for concurrency rate and kv cache token limit (#1608)
AlpinDale committed
November 8, 2025
A
[readme] update installation guide in readme
AlpinDale committed
A
[build] fix aphrodite-kernels wheel installation for pypi compat (#1606)
AlpinDale committed
A
[build][kernels] isolated aphrodite kernel library (#1602)
AlpinDale committed
November 6, 2025
A
[multi node] better cluster example script (#1605)
AlpinDale committed
November 5, 2025
A
[build] upgrade flashinfer to 0.5.1 (#1601)
AlpinDale committed
A
[quant] fix GLM-4.5V AWQ (#1600)
AlpinDale committed
A
[lora][moe] fix MoE models by registering the correct op (#1599)
AlpinDale committed
November 4, 2025
A
[build] downgrade flashinfer to 0.4.1 (#1598)
AlpinDale committed
A
[sync] sync to upstream 03c4c4a (#1597)
AlpinDale committed
A
[offloader] fix async scheduling support with KV cache offloader (#1596)
AlpinDale committed
A
[api] bring back anthropic /v1/messages endpoint in OpenAI server (#1595)
AlpinDale committed
A
[sampler] fix mixed penalties in batch with async scheduling (#1594)
AlpinDale committed
A
[TPU] prevent single-process DP (#1593)
AlpinDale committed
A
[python3.10] import `Self` from `typing_extensions` (#1592)
AlpinDale committed
A
[spec] fix DeepSeek v3.2 MTP metadata and cuda graph (#1591)
AlpinDale committed
A
[v0] remove `APHRODITE_USE_V1` from platform and v1 (#1590)
AlpinDale committed
A
[kvoffload] feat: make LMCache connecter work (#1589)
AlpinDale committed
A
[fix] engine args import
AlpinDale committed