Open hyeonmin11 opened 8 months ago
Hi,
Thank you for the question.
It depends:
It is worth noting that MeViS dataset provides instance-level mask annotations, but for each sentence, its corresponding Ground Truth mask is a binary mask whose foreground region is the target object(s).
instance segmentation model?