SIGN IN SIGN UP

hpcaitech / ColossalAI UNCLAIMED

Making large AI models cheaper, faster and more accessible

0 0 38 Python

[Inference/Feat] Add kvcache quant support for fused_rotary_embedding_cache_copy (#5680)

傅

傅剑寒 committed 2y ago

ef8e4ffe310bfe21f83feb965d962d816d75bc88

Parent: 5cd75ce

Committed by GitHub <noreply@github.com> on 4/30/2024, 10:33:53 AM