Closed levmckinney closed 1 year ago
There's probably some way to use update-alternatives
to make python3.9
the default interpreter although using python3.9
explicitly is OK too.
The Dockerfile LGTM but I have limited context on this project so will defer to @norabelrose on final review.
What this pull request addresses
GPU training on the current docker file does not work due to an incompatibility between the pytorch and cuda versions
What I did
In order to use
torch=1.13.1
you need to usecuda 11.6
the only docker file nvidia provides for this is based onUbuntu 20.04
which ships withpython3.8
. So to satisfy all the requirements for this I had to use a separate ppa to installpython3.9
andpip
. Note that within the docker file you specifically need to run the code using thepython3.9
command and not thepyhton3
orpython
. In addition,pip
should always be used as a module sopython3.9 -m pip install <something>
.Another solution to this problem is to upgrade to support pytorch 2.0 #14.