Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Make asyncio checkpointing work if validate/fit is called more than once (#20952)
* Make asyncio checkpointing work if validate/fit is called more than once. * Apply suggestions from code review * Add assertion to ensure executor is initialized before saving checkpoint * update --------- Co-authored-by: Jirka Borovec <[email protected]> Co-authored-by: Jirka B <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: bhimrazy <[email protected]> (cherry picked from commit ff64a92624b949731a2210ad27fc20959d5a5a01)
J
jj hunt committed
6a9d1101e8af4476cc3ee99a9dcf74a90ea31c90
Parent: 2b23b2b
Committed by Luca Antiga <[email protected]>
on 8/29/2025, 10:07:57 AM