Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Add `EMAWeightAveraging` callback to `weight_averaging.py` (#21260)
A
Alex Morehead committed
9bcba1c1e82b45e10f948dc28fc12f4cf04ab736
Parent: 126fa6f
Committed by GitHub <[email protected]>
on 11/19/2025, 1:38:00 PM