feat(llama.cpp): expose cache_type_k and cache_type_v for quant of kv cache (#4329)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
E
Ettore Di Giacinto committed
d4c1746c7db3d13ba97bb9d8a8b698d8a366a0a7
Parent: 88737e1
Committed by GitHub <noreply@github.com>
on 12/6/2024, 9:23:59 AM