Commits: examples/inference/benchmark_ops/benchmark_kv_cache_memcopy.py - hpcaitech/ColossalAI - Morph

SIGN IN SIGN UP

hpcaitech / ColossalAI UNCLAIMED

Making large AI models cheaper, faster and more accessible

0 0 0 Python

COMMITS

/ examples/inference/benchmark_ops/benchmark_kv_cache_memcopy.py

pre-commit-ci-update-config

May 5, 2024

Y

[Fix] Fix & Update Inference Tests (compatibility w/ main)

Yuanheng Zhao committed 2y ago

May 3, 2024

Y

[kernel] Support New KCache Layout - Triton Kernel (#5677)

Yuanheng Zhao committed 2y ago

April 30, 2024

S

[Inference/Kernel] refactor kvcache manager and rotary_embedding and kvcache_memcpy oper… (#5663)

Steve Luo committed 2y ago

February 28, 2024

Y

[Inference]Add CUDA KVCache Kernel (#5406)

yuehuayingxueluo committed 2y ago