Here is the command I used and the results when using the pre-trained model on RefCOCO for REC recognition. However, the recognition for left and right directions is very poor.
python demo/image_demo.py test_images/two_human/ /home/wh/sjx/yuanban_mm/mmdetection/configs/grounding_dino/grounding_dino_swin-b_finetune_16xb2_1x_coco.py --weights /home/wh/sjx/yuanban_mm/mmdetection/weights/grounding_dino_swin-b_finetune_16xb2_1x_coco_20230921_153201-f219e0c0.pth --texts 'the left person.' --device 'cpu' --tokens-positive -1
Here is the command I used and the results when using the pre-trained model on RefCOCO for REC recognition. However, the recognition for left and right directions is very poor.
python demo/image_demo.py test_images/two_human/ /home/wh/sjx/yuanban_mm/mmdetection/configs/grounding_dino/grounding_dino_swin-b_finetune_16xb2_1x_coco.py --weights /home/wh/sjx/yuanban_mm/mmdetection/weights/grounding_dino_swin-b_finetune_16xb2_1x_coco_20230921_153201-f219e0c0.pth --texts 'the left person.' --device 'cpu' --tokens-positive -1