Open CrossEntropy opened 4 years ago
When I use tesnorflow2.0
, I set the BatchSize
to 128
, although the nan
will appear, the model still recycles the face. This is really amazing. ToT
I suspect it may be a problem with the score function processing method. As you described in your paper, there are three methods:
(1) variance (2) 1x1 convolution (3) uniform.
I think the method of variance can reduce the amount of parameters, so I choice it. Looking forward to your reply!
Hello @CrossEntropy,
It's been a long time since I ran this repo. My suggestion is use smaller batch like 32 or 16, and use a lower version of Tensorflow and Keras since they have updated it recently.
https://github.com/tensorflow/tensorflow/issues/3290 https://github.com/tensorflow/tensorflow/issues/8101 It seems like tf.nn.moments could possibly return nan. You may pick out the nana from the variance and put zero back in. I assume this would solve the issue.
Hi, @shamangary ! I got the following error while training the model
FSA_net_Var_Capsules
And the same phenomenon also appeared in the model I built myself, my model only replaced the
ssr_G_model_build
part. Thanks for your help!