Open anseey opened 7 years ago
It may be the bug of the script for distributed training with latest TensorFlow.
I will refactor the code for distributed soon. If you want to run distributed TensorFlow application, please try tobegit3hub/distributed_tensorflow which is much better now.
@tobegit3hub Thank you! I have tried tobegit3hub/distributed_tensorflow, it works! But it still has the problem https://github.com/tensorflow/tensorflow/issues/5110
Yes, that's something we're working for now.
distributed/cancer_classifier.py works in only one docker container.
It works in one container:
But it not work in two containers:
the error msg I got in the worker: