open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark
https://mmdetection.readthedocs.io
Apache License 2.0
29.21k stars 9.4k forks source link

question about the open vocabulary performance of MM Grounding DINO on the COCO dataset #11783

Closed ymzis69 closed 3 months ago

ymzis69 commented 3 months ago

Thank you for implementing the Grounding DINO model in MMDetection. I currently have a question regarding the open vocabulary performance of MM Grounding DINO on the COCO dataset and hope to get your response. At present, I am researching open vocabulary performance. In the paper "Towards Open Vocabulary Learning: A Survey" (https://arxiv.org/abs/2306.15880), other models achieve a maximum APall of only 61.0 on the COCO dataset when the IoU threshold is 0.5. However, MM Grounding DINO achieves 73.6 for this metric on the COCO dataset. Could you please confirm if these are the same test metrics? If they are the same, does this mean that the open vocabulary performance of MM Grounding DINO significantly surpasses other methods? I look forward to your reply. 1718099686019 1718100030421