ROCm / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
BSD 3-Clause "New" or "Revised" License
19 stars 17 forks source link

delete the redundant instructions & calculation of Welford online algorithm in fused_layer_norm #49

Closed alexshuang closed 3 years ago