SIGN IN SIGN UP

llama: dynamic head_dim and n_rot for SWA (#20301)

* llama: dynamic head_dim and n_rot for SWA

* also add gguf_writer wrappers

* fix build

* build_rope_shift arg reorder
X
Xuan-Son Nguyen committed
59db9a357d9a247009c70fda34050661b17a1a5c
Parent: 23fbfcb
Committed by GitHub <noreply@github.com> on 3/9/2026, 9:22:39 PM