Making large AI models cheaper, faster and more accessible
[gemini] accelerate inference (#3641)
* [gemini] support don't scatter after inference * [chat] update colossalai strategy * [chat] fix opt benchmark * [chat] update opt benchmark * [gemini] optimize inference * [test] add gemini inference test * [chat] fix unit test ci * [chat] fix ci * [chat] fix ci * [chat] skip checkpoint test
H
Hongxin Liu committed
50793b35f49379ecc7b3f8a1a4f858522a561133
Parent: 4b3240c
Committed by GitHub <noreply@github.com>
on 4/26/2023, 8:32:40 AM