Open hhaAndroid opened 10 months ago
@hhaAndroid thank you very much for supporting Grounding DINO finetuning! I just have a few questions:
my goal is to maintain Grounding DINO's versatility in open-set detection but just try to add a few custom classes
Many thanks!
@hhaAndroid thank you very much for supporting Grounding DINO finetuning! I just have a few questions:
my goal is to maintain Grounding DINO's versatility in open-set detection but just try to add a few custom classes
- in this finetuning procedure from the MMDetection docs, it looks like we have to explicitly set the number of classes. does this mean the finetuned model can no longer do open-set detection? or am I misunderstanding something?
- will the finetuned model still be able to handle Referring Expression Comprehension (REC)? for example, can I still prompt the finetuned model with "the left lion"?
- could you please share any script or code snippets on how you achieved the finetuning?
Many thanks!
Maybe the text input of GroundingDINO in mmdet fixed categoly (not real text) 😥
If you encounter any issues while using it, please feel free to create an issue.
This is amazing, thank you!
Can those models be used with the base groundingdino implementation? the configs look quite different, so i guess not? Bummer to change the implementation at this point
Can I finetune grounding dino on a prompt? The thing is that there should be these objects in pretraining data, but I would like to add some additional information to get better predictions. Let's say I only want to detect "black cats". The problem is that I have few data samples, so I would like to tune it a little bit with prompt to use pretrained knowledge.
hi,What are the minimum equipment requirements of fine-tunning grounddino with coco dataset?(default batch-size=32)
Hi All: MMDetection supports Grounding-DINO inference and fine-tuning for now. The mAP we achieved in our reproduction is higher than the official results. We also provide the results of retraining the R50 model from scratch, which exhibits significantly higher performance than the official implementation.
Installation
NOTE
Grounding DINO utilizes BERT as the language model, which requires access to https://huggingface.co/. If you encounter connection errors due to network access, you can download the required files on a computer with internet access and save them locally. Finally, modify the
lang_model_name
field in the config to the local path. Please refer to the following code:Inference
Results and Models
Details for https://github.com/open-mmlab/mmdetection/blob/dev-3.x/configs/grounding_dino/README.md
And we also support GLIP inference and fine-tuning
If you encounter any issues while using it, please feel free to create an issue.