Making large AI models cheaper, faster and more accessible
[ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897)
G
Guangyao Zhang committed
669849d74b3ca6b2a07cd522bc6f56d70e81669c
Parent: fbf33ec
Committed by GitHub <noreply@github.com>
on 7/10/2024, 3:34:25 AM