0 0 0 C++

[Performance] Optimize AdamW GPU kernel and Change the learning rate type to float64 (#78830)

* chore: apply pre-commit format fixes

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* fix optimizer: add get_lr_dtype()

* fix rtol for opTest

* fix

* fix xpu test and fused_adam_kernel

* fix

* fix win

* fix XPU kernel

* fix

* fix test

* fix sharding.py

* revert bate_pow type

* fix test

* fix test2

* fix test3

---------

Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

zhengshengning committed 1mo ago

3d2583434816172863243cea777b243b208a49bb

Parent: 1c1f092

Committed by GitHub <noreply@github.com> on 5/11/2026, 11:01:14 AM