lucasjinreal / DCNv2_latest

DCNv2 supports decent pytorch such as torch 1.5+ (now 1.8+)
BSD 3-Clause "New" or "Revised" License
637 stars 124 forks source link

Error when run testcuda.py #69

Open xuqingyu26 opened 1 year ago

xuqingyu26 commented 1 year ago

The environment is: torch1.11+cu11.3+python3.8+ubuntu22.04

And the project is seccessfully compiled.

When run the testcuda.py ,i run into the following questions: error in modulated_deformable_im2col_cuda: no kernel image is available for execution on the device error in modulated_deformable_col2im_coord_cuda: no kernel image is available for execution on the device error in modulated_deformable_col2im_cuda: no kernel image is available for execution on the device error in modulated_deformable_im2col_cuda: no kernel image is available for execution on the device error in modulated_deformable_col2im_coord_cuda: no kernel image is available for execution on the device error in modulated_deformable_col2im_cuda: no kernel image is available for execution on the device error in modulated_deformable_im2col_cuda: no kernel image is available for execution on the device torch.Size([2, 64, 128, 128]) Traceback (most recent call last): File "testcuda.py", line 285, in example_dpooling() File "testcuda.py", line 239, in example_dpooling dout = dpooling(input, rois, offset) File "/home/user/anaconda3/envs/pytorch1.11.0/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl return forward_call(*input, **kwargs) File "/home/user/PycharmProjects/MOT_Project/1/CenterTrack-master/src/lib/model/networks/DCNv2-pytorch_1.11/dcn_v2.py", line 267, in forward return dcn_v2_pooling( File "/home/user/PycharmProjects/MOT_Project/1/CenterTrack-master/src/lib/model/networks/DCNv2-pytorch_1.11/dcn_v2.py", line 199, in forward output, output_count = _backend.dcn_v2_psroi_pooling_forward( RuntimeError: CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

Could please help me solve this?