Commits - dhruvhead/aphrodite-engine - Morph

SIGN IN SIGN UP

dhruvhead / aphrodite-engine UNCLAIMED

0 0 0 C++

COMMITS

main

March 12, 2026

A

feat: add support for the Qwen3.5 family of models (#1624)

AlpinDale committed 1mo ago

February 17, 2026

L

fix: compute engine max_concurrency from worker KV cache configs to match runtime reporting (#1622)

lucy committed 1mo ago

January 21, 2026

A

fix: mark GLM-4 MoE Lite as an MLA model (#1621)

AlpinDale committed 2mo ago

January 19, 2026

A

[models] add support for GLM-4.7 Flash (#1620)

AlpinDale committed 2mo ago

January 7, 2026

A

fix: tokenizer server init (#1617)

AlpinDale committed 3mo ago

November 22, 2025

A

[cli] add CLI arg for selecting attention backend (#1612)

AlpinDale committed 4mo ago

November 19, 2025

A

[fix] log message killing compilation

AlpinDale committed 4mo ago

November 16, 2025

A

[logger][metrics] log number of cache hits in the request-level logger (#1611)

AlpinDale committed 5mo ago

A

[cli][diffusion] only import diffusion backend when it is called (#1610)

AlpinDale committed 5mo ago

November 10, 2025

A

[diffusion] `aphrodite diffusion` backend (#1607)

AlpinDale committed 5mo ago

A

[engine] add API for concurrency rate and kv cache token limit (#1608)

AlpinDale committed 5mo ago

November 8, 2025

A

[readme] update installation guide in readme

AlpinDale committed 5mo ago

A

[build] fix aphrodite-kernels wheel installation for pypi compat (#1606)

AlpinDale committed 5mo ago

A

[build][kernels] isolated aphrodite kernel library (#1602)

AlpinDale committed 5mo ago

November 6, 2025

A

[multi node] better cluster example script (#1605)

AlpinDale committed 5mo ago

5

[structured outputs] require structured output parameters to be explicitly None or valid (#1604)

50h100a committed 5mo ago

November 5, 2025

A

[build] upgrade flashinfer to 0.5.1 (#1601)

AlpinDale committed 5mo ago

A

[quant] fix GLM-4.5V AWQ (#1600)

AlpinDale committed 5mo ago

A

[lora][moe] fix MoE models by registering the correct op (#1599)

AlpinDale committed 5mo ago

November 4, 2025

A

[build] downgrade flashinfer to 0.4.1 (#1598)

AlpinDale committed 5mo ago

A

[sync] sync to upstream 03c4c4a (#1597)

AlpinDale committed 5mo ago

A

[offloader] fix async scheduling support with KV cache offloader (#1596)

AlpinDale committed 5mo ago

A

[api] bring back anthropic /v1/messages endpoint in OpenAI server (#1595)

AlpinDale committed 5mo ago

A

[sampler] fix mixed penalties in batch with async scheduling (#1594)

AlpinDale committed 5mo ago

A

[TPU] prevent single-process DP (#1593)

AlpinDale committed 5mo ago

A

[python3.10] import `Self` from `typing_extensions` (#1592)

AlpinDale committed 5mo ago

A

[spec] fix DeepSeek v3.2 MTP metadata and cuda graph (#1591)

AlpinDale committed 5mo ago

A

[v0] remove `APHRODITE_USE_V1` from platform and v1 (#1590)

AlpinDale committed 5mo ago

A

[kvoffload] feat: make LMCache connecter work (#1589)

AlpinDale committed 5mo ago

A

[fix] engine args import

AlpinDale committed 5mo ago