Closed naayem closed 2 years ago
HI @naayem , thanks for reporting the issue.
The cuDNN error: CUDNN_STATUS_INTERNAL_ERROR
is a general error which might related to cuda or cudnn installation.
I saw you used your own data here, did you tried the tutorial (BTCV) dataset and default parameters? You can use BTCV to test whether it's the cudaor cudnn problem.
So I did as you suggested and training is running now on BTCV data. I didn't change anything except the input data and json file.
It works, that's a good. Then the problem might related to the parameters, you could check the number of output channels matches the loss function based on your own data, since the error is raised on the loss.backward.
Yes you're right, I didn't take the time to check all the parameters I should have changed for my new dataset. Then the 'RuntimeError: cuDNN error: CUDNN_STATUS_INTERNAL_ERROR' is solved thank you!
Hello, I git cloned the repository into my server. I created a conda environment with python= 3.7.7 In the directory /home/naayem/Projects/swin/research-contributions/SwinUNETR/BTCV I installed the requirements into the environment with
pip install -r requirements.txt
I then run the command:and I get the error below. Am I the only one to get this kind of error. I didn't find any similar error in the issues. And I am perplex with what seems to me a difficult debugging.
Environment (please complete the following information):
My system: Python 3.7.7 (default, Sep 22 2022, 13:53:33) [GCC 9.3.0] on linux
GPU models and configuration
NVIDIA-SMI 455.23.05 Driver Version: 455.23.05 CUDA Version: 11.1 2 NVIDIA V100 PCIe 32 GB GPUs (2×7TFLOPS)