Hi,thank you for your open source classification task.
As for the experiment of object detection task in the paper, I am very confused.
In the appendix, it only mentioned "using standard Cascade Mask R-CNN as the basic framework", then how did you add Vim to the framework , only replacing backbone from ResNet to Vim?
Please spare your precious time to give me some advice and guidance, thank you!
Hi,thank you for your open source classification task. As for the experiment of object detection task in the paper, I am very confused. In the appendix, it only mentioned "using standard Cascade Mask R-CNN as the basic framework", then how did you add Vim to the framework , only replacing backbone from ResNet to Vim? Please spare your precious time to give me some advice and guidance, thank you!