XuyangBai / TransFusion

[PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". https://arxiv.org/abs/2203.11496
Apache License 2.0
619 stars 76 forks source link

question about the version of Waymo Datase #53

Closed seonhoon1002 closed 2 years ago

seonhoon1002 commented 2 years ago

Thank you for sharing your great work.

I have a question about Waymo Dataset

Table 2 in your paper, It was measured by Waymo dataset. But I wonder what is the version you used.

Summary: what did you use version of Waymo?

XuyangBai commented 2 years ago

Hi, I use v1.2

seonhoon1002 commented 2 years ago

There are some weird points, because performances of PVRCNN and LiDAR RCNN in table 2 are too low.

Bellow are performance measured by v1.2 in SST[1] and PV-RCNN++[2] Vehicle(reported in SST) image Pedestrian(reported in SST) image (reported in PV-RCNN++) image

reference [1] Lue Fan, Ziqi Pang, Tianyuan Zhang, Yu-Xiong Wang, Hang Zhao, Feng Wang, Naiyan Wang, Zhaoxiang Zhang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 8458-8468 [2] Shi, Shaoshuai, et al. "PV-RCNN++: Point-voxel feature set abstraction with local vector representation for 3D object detection." arXiv preprint arXiv:2102.00463 (2021)

XuyangBai commented 2 years ago

The results of PVRCNN are borrowed from CenterPoint, and the results of LiDAR-RCNN are taken from the LiDAR-RCNN paper

截屏2022-08-21 12 14 09 截屏2022-08-21 12 12 42

The table here is the results for multi-class detection while the ones you provide is one model trained on one specific class (I assume) ?

seonhoon1002 commented 2 years ago

Q1: The table here is the results for multi-class detection while the ones you provide is one model trained on one specific class (I assume) ?

A1: Vehcle in SST, yes you're right(but pedestrain trained on multi-class), I shouldn't bring vehicle result in SST. I think LiDAR-RCNN result is reasonable.

But in PV-RCNN case, I think there is a problem in Centerpoint. Because they report Waymo V1.0 result not V1.2(V1.0 is lower than v1.2). Please refer bellow table(refer from PV-RCNN original paper[1])

image

I think It is easily confused if not informed version of Waymo in paper.

I think you should edit pv-rcnn result if you can.

reference [1] Shaoshuai Shi, Chaoxu Guo, Li Jiang, Zhe Wang, Jianping Shi, Xiaogang Wang, and Hongsheng Li. PV-RCNN: Point-voxel feature set abstraction for 3D object detection. CVPR, 2020. 1

XuyangBai commented 2 years ago

Oh thanks for pointing out this information which I did not notice before, I will edit this result if I plan to have a new version.