Closed gabrielhuang closed 3 years ago
Thanks for the question. I used Mask R-CNN as the object detection method, when reporting results in Table 4.
I think that Cascade Mask R-CNN was used in Table 2(b) of Swin paper. You may want to take a look at numbers reported in the following link, where the results are matched.
Hi all,
In table 4 of the arxiv preprint https://arxiv.org/pdf/2106.09785.pdf, the reported AP^bb of Supervised = 46.0 Why is this number lower than the ones reported in the Swin paper ?
Also, what object detection method are you using? Is it Mask RCNN or Cascade? There is no mention of the detection method used in the paper.
Thanks!