dvlab-research / FocalsConv

Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
https://arxiv.org/abs/2204.12463
Apache License 2.0
368 stars 35 forks source link

Data augmentation in Section 3.3. #4

Closed CBY-9527 closed 2 years ago

CBY-9527 commented 2 years ago

It's an interesting work. I have a question about the words in section 3.3 of the paper "For ground-truth sampling, we copy the corresponding 2D objects onto images. Rather than using an additional segmentation model or mask annotations [57], we directly crop objects in bounding boxes for simplification." However, copying other objects to the current scene will overlap in the RGB image. Copying paste objects from other scenes in point clouds is effective and overlapping objects can be removed through collision experiments. How to deal with overlapping objects in RGB images?

yukang2017 commented 2 years ago

Hi, thanks for your interests in our work.

It is true that this operation will cause overlap in the RGB image. We did not solve this problem and just leave it here, because it has little relation to our main motivation. And the results still perform well even if this issue remain.

Below is a illustration of one RGB training image with the augmented point cloud projected. image

I think there might be some other well-design method that is specific to this copy-pasting issue. This paper is a reference.