Closed imtiazziko closed 5 years ago
We have not experienced any issue like your NaN case. It seems that our code has been modified based on your screenshot. Could you double check whether there are any differences from our original code or provide more information about this issue? Thanks!
Thanks for replying. But I did not change your code. I think you might be able to see this issue by printing the logs like I did.
@imtiazziko In the original implementation, we are also printing all the losses, but we never have this issue before in any experiments. It also looks like the segmentation loss is really hight before going to NaN, which is not normal. Could you elaborate more details, e.g., training hyper-parameters, which pre-trained weights are used?
Hello @wasidennis
Also how did you decide on the early stopping epoch number 149999?
I really can not reproduce the result of 41.4 % reported in the paper (GTA5-Cityscapes). Can you help me on that?
Thank you very much. Looking forward to your answers.