[Shardformer]fix the num_heads assert for llama model and qwen model (#5704)

* fix the num_heads assert

* fix the transformers import

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix the import

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Wang Binluo committed 2y ago

537f6a3855ad595579404d3ef403fff645643718

Parent: a3cc68c

Committed by GitHub <noreply@github.com> on 5/10/2024, 7:33:39 AM