tensorflow / models

Models and examples built with TensorFlow
Other
77.16k stars 45.76k forks source link

tensorflow.python.framework.errors_impl.AlreadyExistsError #2063

Closed lionel92 closed 6 years ago

lionel92 commented 7 years ago

System information

python object_detection/train.py --logtostderr --pipeline_config_path=object_detection/models/model/rfcn_resnet101_pedestrain.config --train_dir=object_detection/models/model/train

Describe the problem

Describe the problem clearly here. Be sure to convey here why it's a bug in TensorFlow or a feature request.

In the training process, after about 270000 iterations, the error occured. And I could restart the training from the checkpoints. However it happened again at 290000 iteration and again at 310000 iteration. It seems it happens about every 20k iterations. However, I didn't have this kind of errors before when I was training other models using object_detection or in the previous 270000 iterations.

 pywrap_tensorflow.TF_GetCode(status))
tensorflow.python.framework.errors_impl.AlreadyExistsError: object_detection/models/model/train/checkpoint.tmp34ad7bc3cd2a4711ad0092ab5b599a50

Update: Now , it fails to continue training, throwing another error. I don't know if this is related to the first error.

   self._prewrite_check()
  File "/usr/lib/python2.7/site-packages/tensorflow/python/lib/io/file_io.py", line 82, in _prewrite_check
    compat.as_bytes(self.__name), compat.as_bytes(self.__mode), status)
  File "/usr/lib64/python2.7/contextlib.py", line 24, in __exit__
    self.gen.next()
  File "/usr/lib/python2.7/site-packages/tensorflow/python/framework/errors_impl.py", line 466, in raise_exception_on_not_ok_status
    pywrap_tensorflow.TF_GetCode(status))
tensorflow.python.framework.errors_impl.FailedPreconditionError: object_detection/models/model_pedes_new/train/checkpoint.tmp9a60b55633944de6ad4f1fbeceba829a
ymodak commented 6 years ago

Closing this issue due to staleness. Please use the latest version of TensorFlow and test again. Feel free to reopen if the issue still persists. Thanks!