CLIP weights? - Githubissues

facebookresearch / ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Other

689 stars 61 forks source link

Hi @ngthanhtin,

Please check out finetuned CLIP model: ViT-B/16 and ViT-L/14. These models follow the extract same format as OpenAI's CLIP so that you can conduct an apple-to-apple comparison. You can easily use open_clip v1.3 to load the model weights.

self.clip_model, _, _ = open_clip.create_model_and_transforms('ViT-L-14', pretrained=ovsegclip_path)

Unfortunately, we don't have ViT-B/32 weights. You may want to use our open clip training to train your own CLIP ViT-B/32 with our mask-text pairs data.

facebookresearch / ov-seg

CLIP weights? #6