Open Harry-zzh opened 1 year ago
Thanks for your questions. The Grounding DINO is a grounding model, which means it detects objects from images and corresponding phrases from sentences. That may be caused by the confidence of the "plant" is not high. We suggest decrease the text_threshold
in scripts.
Hi, thank you for your excellent work.
When I run the Grounded-Segment-Anything demo, the text prompt that I use is "pottedplant"; however, the text label that appears on the resulting visualization image is "potted".
I wonder why it happens. Thanks!