vllm-project/vllm - Morph

SIGN IN SIGN UP

vllm-project / vllm UNCLAIMED

A high-throughput and memory-efficient inference and serving engine for LLMs

0 0 70 Python

50 branches 158 tags

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference kimi llama llm llm-serving model-serving moe openai pytorch qwen qwen3 tpu transformer

Marceli Fylcek [XPU][Mamba] Triton-based selective scan forward op for XPU (#43421)

f69ede4 · 17d ago · 17,166 Commits

	.buildkite
	.gemini
	.github
	benchmarks
	cmake
	csrc
	docker
	docs
	examples
	requirements
	rust
	scripts
	tests
	tools
	vllm
	.clang-format		641 B
	.coveragerc		969 B
	.dockerignore		371 B
	.git-blame-ignore-revs		191 B
	.gitignore		4.4 KB
	.markdownlint.yaml		140 B
	.pre-commit-config.yaml		11.2 KB
	.readthedocs.yaml		627 B
	.shellcheckrc		496 B
	.yapfignore		55 B
	=4.5.1
	AGENTS.md		3.9 KB
	build_rust.sh		1.4 KB
	CLAUDE.md		11 B
	CMakeLists.txt		54.7 KB
	CODE_OF_CONDUCT.md		5.2 KB
	codecov.yml		441 B
	CONTRIBUTING.md		140 B
	DCO		1.3 KB
	LICENSE		11.1 KB
	MANIFEST.in		212 B
	mkdocs.yaml		4.6 KB
	pyproject.toml		5.2 KB
	README.md		5.6 KB
	RELEASE.md		5.3 KB
	rust-toolchain.toml		29 B
	SECURITY.md		3.6 KB
	setup.py		44.5 KB
	use_existing_torch.py		1.5 KB