traveller59 / second.pytorch

SECOND for KITTI/NuScenes object detection
MIT License
1.72k stars 722 forks source link

Attempt to submit for ScanNet? #81

Open NicksonYap opened 5 years ago

NicksonYap commented 5 years ago

Hi Yan Yan, @traveller59

Awesome repo! (the Docker image really helps)

Currently SparseConvNet tops the Scan-Net 3D Semantic label benchmark http://kaldir.vc.in.tum.de/scannet_benchmark/ code: https://github.com/facebookresearch/SparseConvNet/tree/master/examples/ScanNet

Since SpConv is an improvement of SparseConvNet, could probably try to top it :P

SECOND is more suitable for the Scan-Net 3D Semantic instance benchmark The current winner (3D-SIS) uses multiple 2D RGB features + 3D to obtain semantic instance

Any intentions to extend SECOND to run ScanNet data? (contains ply rgb pointclouds and images)

Regards

traveller59 commented 5 years ago

There are some works and limits to do if I provide support for semantic Instance (mask rcnn):

  1. add point roi pooling and simple rcnn architecture (to do)
  2. unlike fixed-size detection in LiDAR data, the multi-scale detection in scannet need multi-scale architectures such as FPN. I'm not familiar with them. the SECOND is a simple single-stage fixed-size detector.
  3. SECOND is lidar-only detector. the image isn't used. only concatenate rgb features to point cloud is supported.
  4. I don't have access to scannet data. web-based visualization tools are also needed.

In summary, the major problem is multi-scale RCNN architecture and data access/processing/visualization. It's not easy, I have no plan to develop the scannet data process/visualization, but a simple rcnn will be added in future.

NicksonYap commented 5 years ago

@traveller59

I've limited experience with this but I believe ScanNet data is similar of that to KITTI such that:

ScanNet can be downloaded by sending an agreement PDF to an email (It took me a day for a reply) https://github.com/ScanNet/ScanNet#scannet-data You'll be given a .py file to select what to download (it's a file called download-scannet.py :wink: )

By simple RCNN do u mean combining 2D Image data with the point clouds? If we're able to do that then I believe most of the work is already done :sweat_smile: