question about the open vocabulary performance of MM Grounding DINO on the COCO dataset

Thank you for implementing the Grounding DINO model in MMDetection. I currently have a question regarding the open vocabulary performance of MM Grounding DINO on the COCO dataset and hope to get your response. At present, I am researching open vocabulary performance. In the paper "Towards Open Vocabulary Learning: A Survey" (https://arxiv.org/abs/2306.15880), other models achieve a maximum AP_all of only 61.0 on the COCO dataset when the IoU threshold is 0.5. However, MM Grounding DINO achieves 73.6 for this metric on the COCO dataset. Could you please confirm if these are the same test metrics? If they are the same, does this mean that the open vocabulary performance of MM Grounding DINO significantly surpasses other methods? I look forward to your reply. 1718099686019 1718100030421

open-mmlab / mmdetection

question about the open vocabulary performance of MM Grounding DINO on the COCO dataset #11783