siyuanliii / masa

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
https://matchinganything.github.io
Apache License 2.0
940 stars 60 forks source link

How to use Detic instead of GroundingDINO as the detector? #23

Open yashbhalgat opened 2 months ago

yashbhalgat commented 2 months ago

Hi @siyuanliii , I want to perform open-vocabulary detection and tracking using Detic. However, I am not sure how to change the detector from GroundingDINO to Detic. Can you provide some instructions on how to do it?

I downloaded the Detic model from here to saved_models/masa_models directory. But how do I create a config for Detic, etc.? Would appreciate your help with this!

Also, if I want to use OWLv2 as the open-vocabulary 2D model, how would I do that?

Thanks!

siyuanliii commented 1 month ago

Hi, thanks for your question, and sorry for getting to you late! Yes, you can use any open-vocabulary detector! We have provided config files Yolo-x detector, as an example. In the demo/video_demo_with_text.py, try to set --det_config and --det_checkpoint, to be detic's config file and checkpoint. Similarly, if you want to use OWLv2, you can do follow two options: 1). first is to integrate it into mmdetection, then give the config and checkpoint like above. 2). Alternatively, you can modify the video_demo_with_text.py, to replace the results with your detections from OWLv2.