fix(llama.cpp): enable cont batching when parallel is set (#1622)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
E
Ettore Di Giacinto committed
697c769b6422b7084f7c815c5a84bcff50f240f3
Parent: 94261b1
Committed by GitHub <noreply@github.com>
on 1/21/2024, 1:59:48 PM