facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.
Other
3.18k stars 280 forks source link

changes to keep reduced grad in fp32 #1152

Closed vedanuj closed 11 months ago