Describe the problem clearly here. Be sure to convey here why it's a bug in TensorFlow or a feature request.
In the training process, after about 270000 iterations, the error occured. And I could restart the training from the checkpoints. However it happened again at 290000 iteration and again at 310000 iteration. It seems it happens about every 20k iterations. However, I didn't have this kind of errors before when I was training other models using object_detection or in the previous 270000 iterations.
Update:
Now , it fails to continue training, throwing another error. I don't know if this is related to the first error.
self._prewrite_check()
File "/usr/lib/python2.7/site-packages/tensorflow/python/lib/io/file_io.py", line 82, in _prewrite_check
compat.as_bytes(self.__name), compat.as_bytes(self.__mode), status)
File "/usr/lib64/python2.7/contextlib.py", line 24, in __exit__
self.gen.next()
File "/usr/lib/python2.7/site-packages/tensorflow/python/framework/errors_impl.py", line 466, in raise_exception_on_not_ok_status
pywrap_tensorflow.TF_GetCode(status))
tensorflow.python.framework.errors_impl.FailedPreconditionError: object_detection/models/model_pedes_new/train/checkpoint.tmp9a60b55633944de6ad4f1fbeceba829a
Closing this issue due to staleness. Please use the latest version of TensorFlow and test again. Feel free to reopen if the issue still persists. Thanks!
System information
python object_detection/train.py --logtostderr --pipeline_config_path=object_detection/models/model/rfcn_resnet101_pedestrain.config --train_dir=object_detection/models/model/train
Describe the problem
Describe the problem clearly here. Be sure to convey here why it's a bug in TensorFlow or a feature request.
In the training process, after about 270000 iterations, the error occured. And I could restart the training from the checkpoints. However it happened again at 290000 iteration and again at 310000 iteration. It seems it happens about every 20k iterations. However, I didn't have this kind of errors before when I was training other models using object_detection or in the previous 270000 iterations.
Update: Now , it fails to continue training, throwing another error. I don't know if this is related to the first error.