Open yxchng opened 1 year ago
Thanks for the issue. The model output boxes with corresponding noun
in the sentence. In the example, the described words may be ignored.
Same issue for me. Using descriptions like object in hand
would get both object
and hand
results.
Despite this limitation, this work is great and easy-to-implement. Thanks to the authors' excellent work :)
I run the demo using the command
python grounded_sam_demo.py --config GroundingDINO/groundingdino/config/GroundingDINO_SwinT_OGC.py --grounded_checkpoint groundingdino_swint_ogc.pth --sam_checkpoint sam_vit_h_4b8939.pth --input_image assets/demo3.jpg --output_dir "outputs" --box_threshold 0.3 --text_threshold 0.25 --text_prompt "chair behind blue chair" --device "cpu"
However, the text on the output is
chair