Making large AI models cheaper, faster and more accessible
[Shardformer]fix the num_heads assert for llama model and qwen model (#5704)
* fix the num_heads assert * fix the transformers import * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the import --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
W
Wang Binluo committed
537f6a3855ad595579404d3ef403fff645643718
Parent: a3cc68c
Committed by GitHub <noreply@github.com>
on 5/10/2024, 7:33:39 AM