Closed CamilleChiquet closed 4 years ago
If channels first, then batch normalization should be on the axis 1 (channel axis). "axis=-1" is in the case where channels are on the last dimension (batch, width, height, channels).
If channels first, then batch normalization should be on the axis 1 (channel axis). "axis=-1" is in the case where channels are on the last dimension (batch, width, height, channels).