Based on my observation, the bbox of a certain target in Refer-kitti does not match the bbox of the same target in KITTI, and there is a single digit error between the two. May I ask if this is due to your correction of the bbox or my misunderstanding of the KITTI dataset?
Based on my observation, the bbox of a certain target in Refer-kitti does not match the bbox of the same target in KITTI, and there is a single digit error between the two. May I ask if this is due to your correction of the bbox or my misunderstanding of the KITTI dataset?
For example, Video 0 Obj 2 in KITTI:
0 2 Pedestrian 0 0 -2.523309 1106.137292 166.576807 1204.470628 323.876144 ...
and same obj in Refer-KITTI-2:
0,0,1105.999755859375,175.9998779296875,92.99971771240234,142.9998779296875,...