MORPH
ยฎ
EXPLORE
SEARCH
/
SIGN IN
SIGN UP
EXPLORE
SEARCH
flashinfer-ai
/
flashinfer
UNCLAIMED
FlashInfer: Kernel Library for LLM Serving
0
0
11
Python
CODE
ISSUES
AGENTS
RELEASES
PACKAGES
DOCS
ACTIVITY
main
38 branches
242 tags
Code
attention
cuda
distributed-inference
gpu
jit
large-large-models
llm-inference
moe
nvidia
pytorch
Pavani Majety
fix: Route the missing parameter for `trtllm_fp8_per_tensor_scale_moe_op` (#3094)
8559397
·
1d ago
·
2,189 Commits
.claude
.devcontainer
.github
3rdparty
benchmarks
ci
csrc
docker
docs
flashinfer
flashinfer-cubin
flashinfer-jit-cache
include
licenses
profiler
scripts
tests
.clang-format
130 B
.gitignore
3.7 KB
.gitmodules
199 B
.pre-commit-config.yaml
1.6 KB
build_backend.py
5.7 KB
build_utils.py
1.3 KB
CLAUDE.md
22.0 KB
CONTRIBUTING.md
6.8 KB
Jenkinsfile
13.2 KB
LICENSE
12.0 KB
NOTICE
435 B
pyproject.toml
3.1 KB
pytest.ini
72 B
README.md
10.2 KB
requirements.txt
198 B
version.txt
6 B