vllm-project/vllm - Morph

SIGN IN SIGN UP

vllm-project / vllm UNCLAIMED

A high-throughput and memory-efficient inference and serving engine for LLMs

0 0 0 Python

50 branches 140 tags

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference kimi llama llm llm-serving model-serving moe openai pytorch qwen qwen3 tpu transformer

Harry Mellor Fix Mistral yarn warning in Transformers v5 (#37292)

edcc37a · 8h ago · 15,570 Commits

	.buildkite
	.gemini
	.github
	benchmarks
	cmake
	csrc
	docker
	docs
	examples
	requirements
	scripts
	tests
	tools
	vllm
	.clang-format		641 B
	.coveragerc		969 B
	.dockerignore		345 B
	.git-blame-ignore-revs		191 B
	.gitignore		4.3 KB
	.markdownlint.yaml		140 B
	.pre-commit-config.yaml		7.5 KB
	.readthedocs.yaml		635 B
	.shellcheckrc		496 B
	.yapfignore		55 B
	AGENTS.md		3.8 KB
	CLAUDE.md		11 B
	CMakeLists.txt		49.4 KB
	CODE_OF_CONDUCT.md		5.2 KB
	codecov.yml		441 B
	CONTRIBUTING.md		140 B
	DCO		1.3 KB
	LICENSE		11.1 KB
	MANIFEST.in		212 B
	mkdocs.yaml		4.3 KB
	pyproject.toml		4.5 KB
	README.md		4.7 KB
	RELEASE.md		5.3 KB
	SECURITY.md		3.6 KB
	setup.py		39.4 KB
	use_existing_torch.py		1.4 KB