Making large AI models cheaper, faster and more accessible
COMMITS
/ examples/language/llama/benchmark.py December 17, 2024
F
[Device]Support npu (#6159)
flybird11111 committed
November 19, 2024
D
[Zerobubble] merge main. (#6142)
duanjunwen committed
September 10, 2024
B
August 22, 2024
W
[fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016)
Wang Binluo committed
August 16, 2024
F
[fp8] zero support fp8 linear. (#6006)
flybird11111 committed
August 9, 2024
H
[fp8] support gemini plugin (#5978)
Hongxin Liu committed
August 8, 2024
H
August 6, 2024
F
[FP8] rebase main (#5963)
flybird11111 committed
June 26, 2024
B
[gemini] fixes for benchmarking (#5847)
botbw committed
E
[Feature] optimize PP overlap (#5735)
Edenzzzz committed
June 17, 2024
E
Support 4d parallel + flash attention (#5789)
Edenzzzz committed
May 27, 2024
G
correct argument help message
genghaozhe committed
May 25, 2024
G
add args.prefetch_num for benchmark
genghaozhe committed
May 24, 2024
H
Merge branch 'main' of github.com:hpcaitech/ColossalAI into prefetch
hxwang committed
H
[example] add profile util for llama
hxwang committed
B
April 29, 2024
H
[misc] refactor launch API and tensor constructor (#5666)
Hongxin Liu committed
April 26, 2024
T
[hotfix] add soft link to support required files (#5661)
Tong Li committed
April 25, 2024
H
[shardformer] refactor pipeline grad ckpt config (#5646)
Hongxin Liu committed
April 23, 2024
B
[example] llama3 (#5631)
binmakeswell committed