Open abhiskk opened 7 years ago
While calculating variance for the modified batch normalization the variance which is calculated is done after getting detached from the backprop which breaks the backprop.
While calculating variance for the modified batch normalization the variance which is calculated is done after getting detached from the backprop which breaks the backprop.