IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
https://arxiv.org/abs/2303.05499
Apache License 2.0
6.43k stars 661 forks source link

Difference between GroundingDINO and UniDetector. #25

Closed synsin0 closed 1 year ago

synsin0 commented 1 year ago

I am a fresh hand for open-set OD. Now I try to learn the difference between your Grounding DINO with UniDetector. Both can implement open-set detection. I think the difference may be, for each dataset, you need to prompt the novel labels into the network for zero-shot, but the UniDetector inputs many prompts, not limited to the labels in each dataset. Besides, may you provide more insight on the contributions of GroundingDINO? The answer will be helpful to me. Thanks!

SlongLiu commented 1 year ago

You can view our paper for more details. Our model supports any text inputs, which means you can detect everything you can think of. Moreover, we use grounded pre-training, which enables to detect the noun phrases in input sentences. It cannot be finished in the UniDetector.

synsin0 commented 1 year ago

I understand GroundingDINO's ability of grounding in the context and especially in REC tasks. Thanks for your great work and timely response.