Follow Colab tutorial closely, except that I installed detectron2 from local repo with python -m pip install -e .
Running output
Exception during training:
Traceback (most recent call last):
File "/data/yang/detectron2/detectron2/engine/train_loop.py", line 132, in train
self.run_step()
File "/data/yang/detectron2/detectron2/engine/train_loop.py", line 215, in run_step
loss_dict = self.model(data)
File "/home/auv/anaconda3/envs/penguin/lib/python3.7/site-packages/torch/nn/modules/module.py", line 532, in __call__
result = self.forward(*input, **kwargs)
File "/home/auv/anaconda3/envs/penguin/lib/python3.7/site-packages/torch/nn/parallel/distributed.py", line 447, in forward
output = self.module(*inputs[0], **kwargs[0])
File "/home/auv/anaconda3/envs/penguin/lib/python3.7/site-packages/torch/nn/modules/module.py", line 532, in __call__
result = self.forward(*input, **kwargs)
File "/data/yang/detectron2/detectron2/modeling/meta_arch/rcnn.py", line 130, in forward
_, detector_losses = self.roi_heads(images, features, proposals, gt_instances)\
File "/home/auv/anaconda3/envs/penguin/lib/python3.7/site-packages/torch/nn/modules/module.py", line 532, in __call__\
result = self.forward(*input, **kwargs)\
File "/data/yang/detectron2/detectron2/modeling/roi_heads/roi_heads.py", line 582, in forward\
losses = self._forward_box(features, proposals)\
File "/data/yang/detectron2/detectron2/modeling/roi_heads/roi_heads.py", line 643, in _forward_box
box_features = self.box_pooler(features, [x.proposal_boxes for x in proposals])
File "/home/auv/anaconda3/envs/penguin/lib/python3.7/site-packages/torch/nn/modules/module.py", line 532, in __call__
result = self.forward(*input, **kwargs)
File "/data/yang/detectron2/detectron2/modeling/poolers.py", line 233, in forward
output[inds] = pooler(x_level, pooler_fmt_boxes_level)
File "/home/auv/anaconda3/envs/penguin/lib/python3.7/site-packages/torch/nn/modules/module.py", line 532, in __call__
result = self.forward(*input, **kwargs)
File "/data/yang/detectron2/detectron2/layers/roi_align.py", line 95, in forward
input, rois, self.output_size, self.spatial_scale, self.sampling_ratio, self.aligned
File "/data/yang/detectron2/detectron2/layers/roi_align.py", line 20, in forward
input, roi, spatial_scale, output_size[0], output_size[1], sampling_ratio, aligned
RuntimeError: CUDA error: no kernel image is available for execution on the device (ROIAlign_forward_cuda at /data/yang/detectron2/detectron2/layers/csrc/ROIAlign/ROIAlign_cuda.cu:364)
Expected behavior
I have a model which can be trained on another machine with 2 RTX 2080 cards correctly, so I am expecting the same on this machine with GTX1070 + RTX2080 installed
I have read other similar issues, but don't see what I did wrong. Can you shed some light on this? Thanks!
REPRODUCTION OF THE ISSUE
Modification
Environment
Installation
python -m pip install -e .
Running output
Expected behavior
I have a model which can be trained on another machine with 2 RTX 2080 cards correctly, so I am expecting the same on this machine with GTX1070 + RTX2080 installed I have read other similar issues, but don't see what I did wrong. Can you shed some light on this? Thanks!