llama: end-to-end tests (#19802)
* tests: add end-to-end tests per model architecture * fixup for rebase * fix use-after-free in llama-model-loader.cpp * fix CI * fix WebGPU * fix CI * disable CI for macOS-latest-cmake-arm64 * use expert_weights_scale only if != 0.0f * comments
J
Johannes Gäßler committed
a976ff081b4657b67f48295bbefc030d9d899b17
Parent: a950479
Committed by GitHub <[email protected]>
on 3/8/2026, 11:30:21 AM