Closed yilewang closed 2 years ago
Or, does the eddy_cuda
require specific version of the cuda to run the job here? I am using cuda 11.4 in the TACC
.
Also @noahfl , do you mind sharing your singularity def
file? One of the postdoc from TACC
who is helping technical issue suspects that it's a compute capability problem between the image and TACC
A100 GPU.. She wants to look at it~ Thanks so much!
Hey Yile. It's definitely a compatibility issue with A100 GPUs. The only way to enable CUDA acceleration for FSL's EDDY
, PROBTRACKX
, and BEDPOSTX
is to use CUDA 9.2 (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/GPU). A100 GPUs require CUDA >=10.0; as a result, until the FSL developers provide support for CUDA 10+ for EDDY
, PROBTRACKX
, and BEDPOSTX
we are stuck with using non-Ampere GPUs. P100s are what we use on our HPC system but V100s should work as well. CUDA 9.2 is what's provided in the container because of this.
As for the container definition file, use this command to get a definition file for a container:
singularity inspect -d container-name.sif > container-name.def
I'll put it up in a repo for ease of use once I fix it up some more but this will at least allow you to grab it right now. It also works for grabbing the def file of any container pulled from remote.
Btw, these kinds of questions are better suited for the Discussions page. It's a bit easier for us to answer and keep track of them there :smiley:
Hi Noah! Thanks for the reply! I apologize I didn't post them in the discussion page. I will do it after that~ I will double check with TACC system to see if they have non-ampere GPUs provided. I will close this issue with this comment, and if I have more updates, I will post them in the discussion page~
Thanks again for helping out!
Hi! I have a question about the
eddy_cuda
module... even though I requested the gpu module inTACC
, I still fail to execute theeddy_cuda
... I am wondering do I need to install any additional nvidia driver inside the container to make it run? Thanks!The error message
The SLURM script: