Closed Bai-YunHan closed 4 years ago
One more thing i’d like to add, The 1080Ti works fine on my other deep learning job.
Thanks in advance, looking forward to your reply.
FYI, I tried to train on a GTX Titan X(pascal) this morning, gave me the same error: RuntimeError: CUDA error: no kernel image is available for execution on the device
Updates...
If I re-build MMDetection, and only allow it to see 1080Ti, i.e. CUDA_VISIBLE_DEVICES=1 pip install -v -e . Then training with 1080Ti works, but training with 2080Ti fail with "RuntimeError: CUDA error: no kernel image is available for execution on the device”
If I re-build MMDetection, and only allow it to see 2080Ti, i.e. CUDA_VISIBLE_DEVICES=0 pip install -v -e . Then training with 2080Ti works, but training with 1080Ti fail with "RuntimeError: CUDA error: no kernel image is available for execution on the device”
Is there anyway to build mmdetection to be compatible to all existing GPUs?
same problem with P40 and V100
One recommended solution is to build mmdetection on the specific GPU.
Or you can specify the environment variable TORCH_CUDA_ARCH_LIST
to include both your GPU arches.
Solved with TORCH_CUDA_ARCH_LIST
, many thanks.
You suggested method works !
"specify the environment variable TORCH_CUDA_ARCH_LIST
to include both your GPU arches.”
Thanks !
@Bai-YunHan Hello, i have the same problem. for setting the environment variable TORCH_CUDA_ARCH_LIST
, would you tell me what's the exact command should run when build mmdet? Thanks
@TWDH yes, TORCH_CUDA_ARCH_LIST="6.1;7.5" python -m pip install -v -e .
Thanks for your error report and we appreciate it a lot.
Checklist
Reproduction
Environment
python mmdet/utils/collect_env.py
to collect necessary environment infomation and paste it here. sys.platform: linux Python: 3.7.7 (default, May 7 2020, 21:25:33) [GCC 7.3.0] CUDA available: True CUDA_HOME: /usr/local/cuda NVCC: Cuda compilation tools, release 10.2, V10.2.89 GPU 0: GeForce RTX 2080 Ti GPU 1: GeForce GTX 1080 Ti GCC: gcc (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0 PyTorch: 1.5.0 PyTorch compiling details: PyTorch built with:TorchVision: 0.6.0a0+82fd1c8 OpenCV: 4.2.0 MMCV: 0.6.2 MMDetection: 2.2.0+741b638 MMDetection Compiler: GCC 7.5 MMDetection CUDA Compiler: 10.2
$PATH
,$LD_LIBRARY_PATH
,$PYTHONPATH
, etc.) Not likely.Error traceback If applicable, paste the error trackback here.
Bug fix If you have already identified the reason, you can provide the information here. If you are willing to create a PR to fix it, please also leave a comment here and that would be much appreciated!