Closed jpcenteno80 closed 3 years ago
We tested the code on Ubuntu 18.04 with Cuda 10.1 and libcudnn 7.6.
About the error message, this is beyond our code. We did not make any changes to the Keras libs.
I had the same issue and resolved it by modifying some python scripts, see the answer from Michele Bianco at https://stackoverflow.com/questions/52950449/valueerror-when-using-multi-gpu-model-in-keras. Runs fine for me :)
Sorry to bug you again, but with a 4 GPU instance, I ran into this error when executing
python run_step1_heart_localization.py
:So I had to change line 171 of
venv/lib/python2.7/site-packages/tensorflow/python/keras/utils/multi_gpu_utils.py
. From:target_devices = ['/cpu:0'] + ['/gpu:%d' % i for i in target_gpu_ids]
To:target_devices = ['/cpu:0'] + ['/xla_gpu:%d' % i for i in target_gpu_ids]
This is what I am running on Nvidia:
That fixed the issue.