Closed GaryAitken closed 1 year ago
Hi,
This seems to be related to your OS in your Google computing engine. Note that the instructions here will not work will all OS:s. There are many more 'errors' than can occur when setting up GPU instructions for your machine and we can't possibly list all of them. It is up to the user to choose a suitable installation option of the ones we supply (which is why we offer different ways of installation).
This is a part of learning to code/do bioinformatics and we hope that you will find this lesson useful for the future.
Given the rate at which python modules are updated, I understand it's difficult to keep things "current". I recently created a new google compute engine with gpu, A100 40GB w/1 gpu, machine type a2-highgpu-1g (12vCPU, 85GB) I was prompted that the OS config did not support gpu, and asked if I wanted to switch to a supported version. Answering yes, I ended up with what I presume is the "default" config. That was a debian 10 version. Subsequent testing revealed that the debian 10 environment has a ptxas version of 11.0.221, which is
It appeared necessary to select an OS version other than the default, in this case "Debian 11 based Deep Learning VM with M109 and CUDA 11.3" to get an appropriate environment. If this is now a requirement, it would be good to note that and suggest running "ptxas --version" to check. It should look something like this:
Subsequently, the supplied "speed_ppi.yml" created a conda environment as follows:
This resulted in errors when running some_vs_some:
Using trial-and-error and seat-of-the pants guessing, I tweaked speed_ppi.yml to create the following environment which seemed to work:
It appears that at least for some modules, ">=" allows incompatible combinations. numpy in particular; possibly immutabledict and tensorflow-cpu. (I did not try all combinations of numpy, immutabledict and tensorflow-cpu; I stopped when I found the above combination that worked.)