Open junyuchen245 opened 1 month ago
Hi @junyuchen245
Thank you for reaching out. Based on the details you provided, it's unclear whether the issue stems from a mismatch between the H100 architecture and the PyTorch version (1.13.1+cu117) in the Docker, resulting in computations defaulting to the CPU.
I just tested the current version of nBEST on an NVIDIA 4090 GPU, and it successfully completed the demo. This suggests that the current setup functions properly with the 4090 (or older) architecture.
As a potential solution, you might consider trying a different GPU compatible with the existing Docker setup. Alternatively, I am prepared to update the PyTorch version (cu118+) in the Docker to accommodate more advanced GPUs.
Thanks.
Understood. I missed the H100 warning somehow, so I thought the error was caused by something else. It would be great if you could update the PyTorch version to support newer architectures. I'll try this on other GPUs as well. Thank you!
Junyu
Hi @TaoZhong11,
Just a quick update that I was able to run nBEST on a different GPU. Thanks so much for your help!
Junyu
Hi @TaoZhong11 ,
Thanks for this amazing work!
I encountered an error while running the Docker image with data I obtained from https://fcon_1000.projects.nitrc.org/indi/PRIMEdownloads.html. To troubleshoot, I tested the demo dataset by keeping only the file
macaque_sub-032144_ses-001_run-1_T1w.nii.gz
in the data/ directory and removing the rest. However, I received the same error message.The only output I got was brain masks, but they were empty (all zero values).
Any suggestions on how to resolve this issue would be much appreciated! Thank you!