Hi, I am trying to evaluate Grounded SAM on COCO instance segmentation dataset. For that, I am giving a text prompt which is a sentence with all coco classes separated by a comma, and the Grounding dino gives bounding boxes and its corresponding class as a phrase, but sometimes these phases are not matching the given prompt, some times it doesn't even mean anything. Sometimes I am getting multiple classes for a single bounding box, which are not supercategories of each other. Like the phrases are stemmed words of a given text prompt, like fire hydrant is given as firehydr, sometimes it is outside of class like, I got '##hy' as a pred_phrase for a bounding box which is not in the text prompt. Is there any code given for mapping pred_phrases to its corresponding class given in the text prompt.
Hi, I am trying to evaluate Grounded SAM on COCO instance segmentation dataset. For that, I am giving a text prompt which is a sentence with all coco classes separated by a comma, and the Grounding dino gives bounding boxes and its corresponding class as a phrase, but sometimes these phases are not matching the given prompt, some times it doesn't even mean anything. Sometimes I am getting multiple classes for a single bounding box, which are not supercategories of each other. Like the phrases are stemmed words of a given text prompt, like fire hydrant is given as firehydr, sometimes it is outside of class like, I got '##hy' as a pred_phrase for a bounding box which is not in the text prompt. Is there any code given for mapping pred_phrases to its corresponding class given in the text prompt.