Closed Callidior closed 5 years ago
I discovered that the problem was due to the implementation of the freezable BatchNormalization
layer. If initialized with freeze=False
, it is not equivalent to the standard implementation, but always forced to training mode.
I propose to fix this in PR #47.
Fixed in #47.
I previously had no problems training the ResNet50 implementation bundled with Keras in
keras.applications.resnet50
to 70% validation accuracy on the ILSVRC 2012 dataset.Now I wanted to switch to the
keras_resnet
implementation, but was not able to get validation accuracy above 30%. Right after the first epoch, accuracy ofkeras_resnet
is about 1%, while the bundledResNet50
already achieves 11%.I am creating the ResNet like this:
I've already tried different learning rate schedules and optimizers, but nothing worked.
Is there anything special I have to take care of?