Closed okuta closed 6 years ago
Currently MultiNodeBatchNormalizationFunction calls 4 multiply and 2 add operations. This PR unifies it kernel launch.
MultiNodeBatchNormalizationFunction
Replaced by #282
Currently
MultiNodeBatchNormalizationFunction
calls 4 multiply and 2 add operations. This PR unifies it kernel launch.