I have a question about the implementation of P3AFormer on KITTI. As shown in your paper, Swin-B is used as the backbone. How did you handle the resolution problem that the shape of the images in KITTI are 1280 * 384, but the pre-trained models of Swin-Transformer are all with a square resolution.
Thanks!
Hi again,
Could you please upload the code on KITTI dataset?
Hi,
Thanks a lot for your amazing work.
I have a question about the implementation of P3AFormer on KITTI. As shown in your paper, Swin-B is used as the backbone. How did you handle the resolution problem that the shape of the images in KITTI are 1280 * 384, but the pre-trained models of Swin-Transformer are all with a square resolution.
Thanks!
Hi again,
Could you please upload the code on KITTI dataset?
Thanks!