fix(autogptq): do not use_triton with qwen-vl (#1985)
* Enhance autogptq backend to support VL models * update dependencies for autogptq * remove redundant auto-gptq dependency * Convert base64 to image_url for Qwen-VL model * implemented model inference for qwen-vl * remove user prompt from generated answer * fixed write image error * fixed use_triton issue when loading Qwen-VL model --------- Co-authored-by: Binghua Wu <bingwu@estee.com>
S
Sebastian.W committed
0004ec8be3ca150ce6d8b79f2991bfe3a9dc65ad
Parent: d692b2c
Committed by Ettore Di Giacinto <mudler@localai.io>
on 4/11/2024, 10:33:58 AM