wenyuqing / panacea

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
https://panacea-ad.github.io/
Apache License 2.0
197 stars 12 forks source link

A issue about "nuscenes2d_ego_temporal_infos_train.pkl" #28

Closed fayewong666999 closed 2 weeks ago

fayewong666999 commented 1 month ago

It's a great job. I have a question, in this work, you use the file "nuscenes2d_ego_temporal_infos_train.pkl", and I observe that the "gt_boxes" in this file are different from the gt_boxes in the "nuscenes2d_temporal_infos_train.pkl" file, is it because the coordinates in the two are in different coordinate systems? Or does the gt_boxes representation in the ego coordinate system have a better advantage? Thank you!

wenyuqing commented 2 weeks ago

Sorry for the late reply! The nuscenes2d_ego_temporal_infos_train.pkl file uses the ego coordinate system, while nuscenes2d_temporal_infos_train.pkl uses the lidar coordinate system. The lidar and ego systems differ by a transformation. The choice to use nuscenes2d_ego_temporal_infos_train.pkl with the ego coordinate system is because it also includes lane lines, so both the lane lines and ground truth boxes are in the ego system. Then, the same transformation is applied to project them into the camera coordinate system. Since the generation is done in the image space, it doesn’t matter which coordinate system’s annotations are used for training, as everything will ultimately be generated in the image space.