Activity - vllm-project/vllm - Morph

SIGN IN SIGN UP

vllm-project / vllm UNCLAIMED

A high-throughput and memory-efficient inference and serving engine for LLMs

0 0 0 Python

ACTIVITY

Commits Reviews

week month year

COMMITS

20

in the last week

CONTRIBUTORS

18

active

STARS

0

total

FORKS

0

total

TOP CONTRIBUTORS

O

omerpaz95

2 commits

Z

z1ying

2 commits

N

Netanel Haber

1 commit

T

TJian

1 commit

F

Flora Feng

1 commit

L

Luciano Martins

1 commit

M

mysterious hhhh

1 commit

D

Dan Alistarh

1 commit

Y

Yusuf Mohammad

1 commit

J

Jee Jee Li

1 commit

RECENT COMMITS

N

Optimize nemotron VL image/video preprocessing (#40283)

Netanel Haber 16h ago

T

[FEAT] [Perf] [Gemma4] Fused Gemma4 Routing Function Triton (#39083)

TJian 21h ago

F

[Bugfix] Kimi-K2 tool parser streaming - fix token leakage, argument truncation, and content dropping (#38579)

Flora Feng 22h ago

O

[KV Offload] Pass request context (#39185)

omerpaz95 1d ago

O

[KV Connector] Allow metrics of multiple connectors of same types in multi connector. (#40010)

omerpaz95 1d ago

L

[Frontend] Preserve structured output special tokens in offline LLM.chat (#39352)

Luciano Martins 1d ago

M

[Bugfix] Guard mxfp4_experts_quant bindings on ENABLE_NVFP4_SM100 (#40191)

mysterious hhhh 1d ago

D

[Attention] TurboQuant: remove redundant random signs, add prior art attribution (#40194)

Dan Alistarh 1d ago

Y

Added general ND x ND matmul and unit test for it (#39909)

Yusuf Mohammad 1d ago

J

[DOC] Add fuse_minimax_qk_norm (#39782)

Jee Jee Li 1d ago

N

[Refactor] Drop direct dependency on librosa (#39079)

Nick Cao 2d ago

C

[ZenCPU] AMD Zen CPU Backend with supported dtypes via zentorch weekly (#39967)

Chinmay-Kulkarni-AMD 2d ago

R

[Bugfix] Fix k_proj's bias for GLM-ASR (#40160)

Rishapveer Singh 2d ago

Z

[Doc] Fix outdated source reference comment in anthropic/serving.py (#40189)

z1ying 2d ago

N

[Frontend] Add multimodal support to /inference/v1/generate endpoint (#38405)

Nithin Chalapathi 2d ago

Z

[Doc] Add Realtime Transcription section to supported_models.md (#39845)

z1ying 2d ago

M

[Core] Reduce mm scheduler, get_num_embed overhead (#40143)

milesial 2d ago

C

[XPU] fix all_reduce all-zero accuracy issue under torch.compile (#39844)

Chaojun Zhang 2d ago

M

[CI] Speed up test_fused_marlin_moe (#40178)

Michael Goin 2d ago

X

[XPU]fake impl for xpu fp8_gemm (#39984)

Xinyu Chen 2d ago