IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
https://arxiv.org/abs/2303.05499
Apache License 2.0
6.24k stars 648 forks source link

Try OV-DINO, a more powerful open-vocabulary detector. #350

Open wanghao9610 opened 1 month ago

wanghao9610 commented 1 month ago

Thanks for the awesome Grounding-DINO, I share our recent work 🦖OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion.

We have released the evaluation, fine-tuning, demo code in our project, feel free to try our model for your application.

Project: https://wanghao9610.github.io/OV-DINO

Paper: https://arxiv.org/abs/2407.07844

Code: https://github.com/wanghao9610/OV-DINO

Demo: http://47.115.200.157:7860/

Welcome everyone to try our model and feel free to raise issue if you encounter any problem.

HaisongDing commented 1 month ago

With the same G-DINO network, MM-Grounding-DINO's implementation can already achieve zero-shot 50.4 mAP on COCO. See: https://github.com/open-mmlab/mmdetection/blob/dev-3.x/configs/mm_grounding_dino/README.md