Closed basitanees closed 3 years ago
You could try with CUDA=10.0, cudnn=7.6.4, tensorflow-gpu==1.12.0.
Thank you for your feedback. It was actually related to CPU RAM. Increasing the RAM solved the issue while using your mentioned cuda and tensorflow versions.
Hi,
I am using 2 Tesla v100s. My program gets killed when using mode than 1 gpu.
When using 8 Tesla_t4, it gets killed as well (with some memory errors which were removed by allowing growth and limiting memory fraction)
My guess is it could be due to incompatibility issues. Which version of CUDA, cuDNN and NCCL versions are you using for your implementation? I am currently using:
Any help would be appreciated.
Regards,