Closed xytintel closed 1 month ago
Resolve issue: https://github.com/pytorch/pytorch/issues/136007
@dvrogozh Double checked, out of the above 3 kernels, only BatchNormBackwardReduceChannelsLastKernelFunctor need barrier for reduction.
BatchNormBackwardReduceChannelsLastKernelFunctor
Resolve issue: https://github.com/pytorch/pytorch/issues/136007