Bump vLLM version + more options when loading models in vLLM (#1782)
* Bump vLLM version to 0.3.2 * Add vLLM model loading options * Remove transformers-exllama * Fix install exllama
L
Ludovic Leroux committed
939411300ab55cc84690e62442b51ab0f2c9de3b
Parent: 1c31268
Committed by GitHub <noreply@github.com>
on 3/1/2024, 9:48:53 PM