fix: ExLlama Backend Context Size & Rope Scaling (#1311)
* fix: context_size not propagated to exllama backend * fix: exllama rope scaling
O
ok2sh committed
20d637e7b70cf0e15e6bf255ab2e4c080ddde2b0
Parent: 480b14c
Committed by GitHub <noreply@github.com>
on 11/21/2023, 6:26:39 PM