Closed SteveJMao closed 11 months ago
Hello. I'm co-author Junhyung Lee. Sorry for the late reply. As you mentioned, if you use a total of 3 frames including the target frame(denoted as t), the learning method for the first and second frame of each scene in nuScenes dataset is as follows.
Any additional questions are welcome !!
Thank you for your replying!Now I understand it!
Hello,great appreciation to your great work! I was wondering about the processing pipline about long-term bev aggregation. I noticed that you apply 3 past frames to do long term bev temporal alignment and I wondered how you deal with the first two frames which do not have previous 3 frames. Are you just skip the first two frames or apply different operations? Looking forward to hearing from you soon!