Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Add broadcast to Dataset Optimizer with multiple nodes (#18860)
Co-authored-by: Luca Antiga <[email protected]> Co-authored-by: thomas <[email protected]>
T
thomas chaton committed
0843041d1d1d0a4e3b0a0aebd3980b002aab8c4e
Parent: 182c30b
Committed by GitHub <[email protected]>
on 10/26/2023, 11:42:46 PM