Closed rggs closed 4 years ago
same here
I'm putting this here because this fixed it for me: https://github.com/tensorflow/models/issues/8423#issuecomment-620188942
You may have labels in your .pbtxt that differ from the labels in your .csv files. Try checking to make sure that the capitalization is the same for all the labels.
When training my model, it runs normally for a while, but quite suddenly the loss will explode to values exceeding 1e19. Has anyone had this issue? When it happens, it will occasionally fall back down to ~.05, only to shoot back up. Here is an example of the console output:
What is going on here? I'm using the faster_rcnn_inception_v2_coco model. I should add that I've seen similar issues where the number of classes is wrong in the config file, but that doesn't seem to be the case here. As far as I can tell, my number of classes is correct.