[Performance] Optimize AdamW GPU kernel and Change the learning rate type to float64 (#78830)
* chore: apply pre-commit format fixes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix optimizer: add get_lr_dtype() * fix rtol for opTest * fix * fix xpu test and fused_adam_kernel * fix * fix win * fix XPU kernel * fix * fix test * fix sharding.py * revert bate_pow type * fix test * fix test2 * fix test3 --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Z
zhengshengning committed
3d2583434816172863243cea777b243b208a49bb
Parent: 1c1f092
Committed by GitHub <noreply@github.com>
on 5/11/2026, 11:01:14 AM