Use which feature to classify for demo.

facebookresearch / ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Other

676 stars 61 forks source link

Hi @synsin0,

Yes, we only use ov-seg (MaskFormer) to produce mask proposals, leaving its class predictions unused in demo. The reason is, like you also mentioned, the performance would become worse if we use it. We conjecture this is because the open-vocabulary classifier of ov-seg (MaskFormer) is trained with COCO-171, resulting it fitting to these 171 classes while being unable to handle the diverse cases in the demo.

If you only want to use ov-seg (MaskFormer) class prediction (The MaskFomer only results in Table 5), you may want to turn CLIP_ENSEMBLE to False as in here.

I close this issue, feel free to reopen it if you have further questions.

facebookresearch / ov-seg

Use which feature to classify for demo. #12