IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
https://arxiv.org/abs/2401.14159
Apache License 2.0
14.88k stars 1.38k forks source link

Fine-tune grounded dino #453

Open Knivacke opened 7 months ago

Knivacke commented 7 months ago

First of all - what an amazing framework, I'm blown away.

Is it possible to fine-tune the automatic labelling process of grounded SAM, ie. grounded DINO? I'm using the model on newspaper images from Sweden, and would for example need more consistent classification of Swedish police officers.

Thanks.

rentainhe commented 7 months ago

It's better to finetune Grounding-DINO for better localization in this pipeline

Knivacke commented 7 months ago

Thanks for the response. As in fine-tuning Grounding-DINO separately, and using my fine-tuned model in place of base grounding DINO for grounded SAM?