Open digital-thinking opened 4 years ago
I've also run into the same problem. Had no other way out but to revert back to 0.0.4. Perhaps there's a bug in the new version of pre-train weights.
I thought I had this problem, then discovered it was simply my model setup. I'm training successfully on version 1.1.0.
Can you show which setup are you using now? 🙏
@MarcRoigCampos Please forgive the untidiness of this, it's being used for a class assignment and wasn't really meant to be seen. https://gist.github.com/latrare/773128d9a5f15e31854ea196b414b89a
Note, I am not using the pre-defined weights here, so that maybe where this issue lies if even after adjusting your program it still does not train the model.
@latrare Thanks a lot!!
I have just upgraded from TensorFlow 1 with efficientnet 0.0.4 to the TensorFlow 2 version of the model. The model looks identical (besides the names of the tensors), params are equal, training data is equal. However, with TensorFlow 2.0 the model does not learn anymore. I have tried TensorFlow 2.0 with CUDA 10.0 and TensorFlow 2.1 with CUDA 10.1 both with the same results. I wonder if anyone else has also noticed issues like that. With TF1 and the 0.0.4 model everything works fine. Also the training is slower than before. (RTX 2080Ti)
TF2(efficientnet 1.0.0)
TF1(efficientnet 0.0.4):