How to give mmgrounding dino few-shot image examples like owl-vit?

open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark

https://mmdetection.readthedocs.io

Apache License 2.0

28.5k stars 9.28k forks source link

How to give mmgrounding dino few-shot image examples like owl-vit? #11726

Open zappy586 opened 1 month ago

zappy586 commented 1 month ago

Mmgrounding seems really promising for few-shot object detection. But the early modality fusion makes the architecture very confusing. Has anyone tried to convert this model into a few-shot learner or has any ideas on how to do it?