Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Utility to consolidate sharded checkpoints (#19213)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
A
awaelchli committed
b1127e360810152da7b8c4e66e7e0d619a7c8aac
Parent: ed367ca
Committed by GitHub <noreply@github.com>
on 1/23/2024, 10:15:22 PM