linyq2117 / CLIP-ES

MIT License
171 stars 9 forks source link

Set of referring image segmentation queries #14

Closed SouthFlame closed 9 months ago

SouthFlame commented 9 months ago

Thanks for your interesting work!!

I cannot get the construction details of the initial text queries for referring image segmentation.

If the detail has existed on the paper, I would be sorry to ask about it, and excuse me, please.

Best regards,

Namyup Kim.

linyq2117 commented 9 months ago

Hi, thanks for your interest.

I did not understand your question well. Our method is designed for weakly supervised semantic segmentation and image-level labels (class names per image) have been provided in this setting. We only augment class names with prompts and synonyms (Section 3.2) as the text input of CLIP. This may be the initial text queries you mentioned?

SouthFlame commented 9 months ago

Thanks for the answer! I am sorry to ask a misunderstood question, but, your answer let me understand it.