tensorflow / benchmarks

A benchmark framework for Tensorflow
Apache License 2.0
1.15k stars 634 forks source link

To solve horovod's execution error of multi-GPU with multi-server #450

Closed jayhpark530 closed 4 years ago

jayhpark530 commented 4 years ago

Fix GPU allocation error when learning from multi-GPU with multi-server.

reedwm commented 4 years ago

Since I do not know how to use Horovod and tf_cnn_benchmarks is unmaintained, I cannot verify if this PR works, so I unfortunately cannot accept it :(. In general, I can no longer address Horovod issues. I apologize for not being able to accept this PR.