Open wboykinm opened 6 years ago
UPDATE: I bumped the driver to the [apparently] current version, and it threw the same error as above.
Hey @wboykinm ! It's been a while since I last used that script for deploying this container, so I'm afraid it's pretty much outdated. My recommendation right now would be to create an instance based on one of the AMIs provided by NVIDIA, which already comes prepared with the appropriate drivers and nvidia-toolkit versions.
I use the AMI named "NVIDIA CUDA Toolkit 7.5 on Amazon Linux" an that one works pretty well, the only thing you need to manually install after creating the instance would be docker and nvidia-docker. After that you should be ready to run the container!
Following the remote-launch outline laid out in @albarji's blog post . . .
p2.xlarge
server with Ubuntu 16.04.2 LTS (GNU/Linux 4.4.0-1020-aws x86_64). . . I get this:
This seems like a driver mismatch. I'm unable to test this locally, unfortunately (wrong GPU), so I'm left to guess if the image needs rebuilding or if I need to change my EC2 config somehow. It looks like the appropriate driver version needs a bump.