RuntimeError: CUDNN_STATUS_EXECUTION_FAILED

Issue description

demo.py fails to run with the error below

RuntimeError: CUDNN_STATUS_EXECUTION_FAILED

Code example

Command to reproduce the bug:

Remove torch and torchvision in nvcr.io/nvidia/pytorch:19.11-py3
```
root@501243bba88b:/swook# pip uninstall torch torchvision
```

Install torch==0.3.1 complied with CUDA_VERSION = 9000

root@501243bba88b:/swook# conda install https://anaconda.org/pytorch/pytorch/0.3.1/download/linux-64/pytorch-0.3.1-py36_cuda9.1.85_cudnn7.0.5_2.tar.bz2

We can find torch==0.3.1 for other CUDA from here

Install torchvision==0.2.1

root@501243bba88b:/swook# pip install torchvision==0.2.1

Inspect containers

(py36torch14) swook@durian:/data/swook/download$ docker ps

Commit a container for nvcr.io/nvidia/pytorch:19.11-py3

(py36torch14) swook@durian:/data/swook/download$ docker commit fc5d3760e589 swook/torch031:19.11-py3

Run a demo in a swook/torch031:19.11-py3

root@501243bba88b:/swook/repos/tencent/dsfd# python demo.py --trained_model /swook/model/dsfd/WIDERFace_DSFD_RES152.pth --widerface_root /swook/dataset/wider-face/WIDER_val/ --save_folder ./save --visual_threshold 0.1 --cuda CUDA

Error messages:

RuntimeError: CUDNN_STATUS_EXECUTION_FAILED

Whole stack traces:

Traceback (most recent call last):
  File "demo.py", line 207, in <module>
    test_oneimage()
  File "demo.py", line 172, in test_oneimage
    det0 = infer(net , img , transform , thresh , cuda , shrink)
  File "demo.py", line 72, in infer
    y = net(x)      # forward pass
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/module.py", line 357, in __call__
    result = self.forward(*input, **kwargs)
  File "/swook/repos/tencent/dsfd/face_ssd.py", line 238, in forward
    conv3_3_x = self.layer1(x)
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/module.py", line 357, in __call__
    result = self.forward(*input, **kwargs)
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/container.py", line 67, in forward
    input = module(input)
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/module.py", line 357, in __call__
    result = self.forward(*input, **kwargs)
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/conv.py", line 282, in forward
    self.padding, self.dilation, self.groups)
  File "/opt/conda/lib/python3.6/site-packages/torch/nn/functional.py", line 90, in conv2d
    return f(input, weight, bias)

System Info

PyTorch or Caffe2: PyTorch
How you installed PyTorch (conda, pip, source): docker (nvcr.io/nvidia/pytorch)
Build command you used (if compiling from source): None
OS: Ubuntu 16.04 LTS
PyTorch version: 0.3.1.post2
Python version: 3.6
CUDA/cuDNN version: 10.2
GPU models and configuration: 2080 Ti
GCC version (if compiling from source): None
CMake version: None
Versions of any other relevant libraries: None

swoook / dsfd

RuntimeError: CUDNN_STATUS_EXECUTION_FAILED #7

Issue description

Code example

System Info