Open power76 opened 4 years ago
tensorflow version is required to be 1.15 now. What are the other new errors ?
tensorflow version is required to be 1.15 now. What are the other new errors ?
I got confused totally.
I have successfully run the model "faster_rcnn_resnet101_pets" training on GCP. I thought the error above came from the windows OS probably. After I followed the instruction on Linux platform. All errors have gone. But when I try the model "ssd_mobilenet_v1_coco",the errors remained as above like:
master-replica-0 Command '['python', '-m', u'object_detection.model_main', u'--model_dir=gs://mymodel_bucket/model_dir', u'--pipeline_config_path=gs://mymodel_bucket/data/ssd_mobilenet_v1_pets.config', '--job-dir', u'gs://mymodel_bucket/model_dir']' returned non-zero exit status 1.
I am still struggling with this error. Any solutions ?
Prerequisites
Please answer the following questions for yourself before submitting an issue.
1. The entire URL of the file you are using
https://github.com/tensorflow/models/tree/master/research/...
2. Describe the bug
A clear and concise description of what the bug is. When I run the model training on Google Cloud as the instruction. There comes the error: The replica ps 0 exited with a non-zero status of 1. Termination reason: Error. Traceback (most recent call last): File "/usr/lib/python2.7/runpy.py", line 174, in _run_module_as_main "main", fname, loader, pkg_name) File "/usr/lib/python2.7/runpy.py", line 72, in _run_code exec code in run_globals File "/root/.local/lib/python2.7/site-packages/object_detection/model_main.py", line 23, in
import tensorflow.compat.v1 as tf
ImportError: No module named v1
I have followed all the steps of the repository but can't solve the problem except changing the Google Cloud Runtime-version to 1.15. But it will introduce some new errors.
3. Steps to reproduce
Steps to reproduce the behavior.
4. Expected behavior
A clear and concise description of what you expected to happen.
5. Additional context
Include any logs that would be helpful to diagnose the problem.
6. System information