Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Model checkpointing `save_on_train_epoch_end` default behavior documentation (#20931)
* default behavior * clarify some docs (cherry picked from commit ab7b29950fcb090ec3712f1fe788a0ba8605bb7f)
S
Shion Matsumoto committed
c987e064c6155fa623b0a4611a532cc5df6ddd97
Parent: b8bd96f
Committed by Jirka Borovec <[email protected]>
on 8/13/2025, 7:19:50 PM