Making large AI models cheaper, faster and more accessible
TAGS
20 tags v0.3.1
[release] update version (#4332) * [release] update version * [devops] hotfix cuda extension building * [devops] pytest ignore useless folders
v0.1.12
[zero] add L2 gradient clipping for ZeRO (#2112) * [zero] add L2 gradient clipping * [testing] add MlpModel * [zero] add unit test for grad clipping * fix atol
v0.1.11rc5
[release] update to 0.1.11rc5 (#2053)
v0.1.11rc4
[workflow] fixed the python and cpu arch mismatch (#2010)
v0.1.11rc3
[release] update version (#1931)
v0.1.11rc2
[doc] polish diffusion README (#1840)
v0.1.11rc1
[hotfix] resharding cost issue (#1742)