llama: end-to-end tests (#19802)

* tests: add end-to-end tests per model architecture

* fixup for rebase

* fix use-after-free in llama-model-loader.cpp

* fix CI

* fix WebGPU

* fix CI

* disable CI for macOS-latest-cmake-arm64

* use expert_weights_scale only if != 0.0f

* comments

Johannes Gäßler committed 24d ago

a976ff081b4657b67f48295bbefc030d9d899b17

Parent: a950479

Committed by GitHub <[email protected]> on 3/8/2026, 11:30:21 AM