Big problems with GAN and keras 2.0 because of new batchnorm

engharat commented 7 years ago

Hi, I work with Conditional GANs and in the last days I'm working with Conditional Wasserstein GAN keras implementation, starting from this code: https://github.com/tdeboissiere/DeepLearningImplementations/tree/master/WassersteinGAN the problem is that Conditional WGAN (and GANs and Conditional GANs too) don't produce correct output when batchnorm is not set on batchnorm_mode=2 I tried mode=0 before keras 2.0 and the generated images by the generator become indipendent by the noise, so I do not get any variability. So, in keras 2.0 I cannot work with WGAN/GAN anymore, being mode=2 removed by new batchnorm implementation. I think this is a very big issue and I hope the old batchnorm modes will be supported again, otherwise keras would not work on most of recent generative models.

Please make sure that the boxes below are checked before you submit your issue. If your issue is an implementation question, please ask your question on StackOverflow or join the Keras Slack channel and ask there instead of filing a GitHub issue.

Thank you!

[ X] Check that you are up-to-date with the master branch of Keras. You can update with: pip install git+git://github.com/fchollet/keras.git --upgrade --no-deps
[ X] If running on TensorFlow, check that you are up-to-date with the latest version. The installation instructions can be found here.
[ X] If running on Theano, check that you are up-to-date with the master branch of Theano. You can update with: pip install git+git://github.com/Theano/Theano.git --upgrade --no-deps
[ X] Provide a link to a GitHub Gist of a Python script that can reproduce your issue (or just copy the script here if it is short).

bstriner commented 7 years ago

Lots of issues around this. The real question is, what should BN do for a GAN?

Batchnorm real and generated together?
Batchnorm each separately?
Batchnorm the real, and use those stats to batchnorm generated?

There are a lot of ways I could imagine using BN in a GAN, and I can't be sure about the best until someone tests them out and writes a paper.

Cheers

engharat commented 7 years ago

I think the several options you listed could be referred to the discriminator, and in fact we still don't know the appropriate way to use BN on it - I agree with your considerations. Anyway, the problem here refers to the generator - its BN influence heavily the generated images. And beside the theoretical analysies, still remains the main issue: in the new keras 2.0 have been removed a BN mode that is pivotal for the correctness of GAN models, and this should be addressed as soon as possible if we wanna see a broad adoption of keras 2.0.

bstriner commented 7 years ago

So you're having issues using BN in the generator? Using BN in the discriminator has tons of considerations but in the generator should be fine. Do you have something simple to demo the problem?

engharat commented 7 years ago

I have done some experiments about this issue:

The problem is not on discriminator BatchNorm, but in the generator Batchnorm: the generative model is working nicely both with batchnorm mode=0 and mode=2 on Discriminator. When I try mode=0 on generator, i get garbage as output.
This issue happens both on Wasserstein GAN and conditional Wasserstein GAN on a simple dataset generation task, that is on MNIST dataset.
I'm confident it happens also on regular GAN.

Step to easily reproduce the problem:

Use keras 1.2, the code need to be updated on few parts in order to run on keras 2.0 (still the problem happens on keras 2.0 as well)
clone the github repo tdeboissiere/DeepLearningImplementations: git clone https://github.com/tdeboissiere/DeepLearningImplementations.git
modify the file DeepLearningImplementations/WassersteinGAN/src/utils/data_utils.py adding those lines at the very beginning of the file: import matplotlib matplotlib.use('Agg') in order to let the code be able to save some samples of the current training on disk.
run the main.py file located into DeepLearningImplementations/WassersteinGAN/src/model/main.py with default parameters (bn_mode is 2) python main.py
look at the nice output after some epochs (10 is enough to see the issue) in file DeepLearningImplementations/WassersteinGAN/figures/current_batch.png The first two rows are the generated MNIST images at the current training step (using random input noise), the other two are some real MNIST images taken at random from the target dataset.

-repeat the experiments with bn_mode=0: python main.py bn_mode=0 and you will see that the output will be garbage: in particular the 8 output images are always the same (beside a very very little variation), while each of those should be different - maybe it could be explained as a total GAN collapse.

adamcavendish commented 7 years ago

@engharat Do you mean the feature-wise normalization?

2: feature-wise normalization, like mode 0, but
                using per-batch statistics to normalize the data during both
                testing and training.

engharat commented 7 years ago

Exactly! Feature-wise normalization seems to be crucial for correct GAN output at test time

stale[bot] commented 7 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 30 days if no further activity occurs, but feel free to re-open a closed issue if needed.