Making large AI models cheaper, faster and more accessible
[Fix/Inference]Fix vllm benchmark (#5630)
* Fix bugs about OOM when running vllm-0.4.0 * rm used params * change generation_config * change benchmark log file name
Y
yuehuayingxueluo committed
90cd5227a348dfe506e95b2e49f2a8dcd34fdbca
Parent: 279300d
Committed by GitHub <noreply@github.com>
on 4/24/2024, 6:51:36 AM