ggml-zendnn : adaptive fallback to CPU backend for small batch sizes (#22681)
* ggml-zendnn : add runtime env var GGML_ZENDNN_ADAPTIVE_FALLBACK to control adaptive fallback (default: enabled) * ggml-zendnn : restore original fallback logic when adaptive fallback is disabled
S
Sachin Sharma committed
61af07c22df7e06d07905a74f39d3809e6c0522f
Parent: 856c3ad
Committed by GitHub <noreply@github.com>
on 5/13/2026, 6:13:47 AM