filaPro / oneformer3d

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation
Other
340 stars 32 forks source link

Some questions about Oneformer3D #31

Closed RayYoh closed 8 months ago

RayYoh commented 8 months ago

Hello authors, thanks for your great efforts, and congratulations for this work has been accepted by CVPR2024.

After reading the code, I have two questions about the implementations.

Looking for your reply.

Best, Ray

oneformer3d-contributor commented 8 months ago

Hi @RayYoh ,

  1. We use superpoints on ScanNet and do not use on S3DIS. The main reason here is that gt annotations on ScanNet are given for the same set of superpoints, so using them boosts segmentation metrics.
  2. The only reason is that at some point starting with Mask3D chekpoint on ScanNet200 brings slightly better metrics, compared to SSTNet checkpoint.
RayYoh commented 8 months ago

Hi @RayYoh ,

  1. We use superpoints on ScanNet and do not use on S3DIS. The main reason here is that gt annotations on ScanNet are given for the same set of superpoints, so using them boosts segmentation metrics.
  2. The only reason is that at some point starting with Mask3D chekpoint on ScanNet200 brings slightly better metrics, compared to SSTNet checkpoint.

Hi, oneformer3d contributors @oneformer3d-contributor. Thanks for your kind reply. I have understood Q2. But for Q1, during my training, I have changed the supervised signal to per point instance mask instead of superpoint mask in Oneformer3D (here), which means I didn't use scatter_mean and I remap the predicted voxel masks into point masks.

filaPro commented 8 months ago

I think we didn't try our method on ScanNet without superpoints. Only on S3DIS. And I'm not sure if training w/o superpoints can improve metrics on ScanNet...