Closed chaitjo closed 6 years ago
They are not the same. LayerNorm paper: Layer normalization, https://arxiv.org/abs/1607.06450 BatchNorm paper: Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, https://arxiv.org/abs/1502.03167
I see, thank you for the information!
Keras implements a BatchNormalization layer. Isn't the LayerNormalization class the same thing?
Ref: https://keras.io/layers/normalization/
(Or is the code for a version of Keras where BN was not implemented?)