openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
MIT License
24.55k stars 3.2k forks source link

Please add VIT-G support #419

Open ppbrown opened 7 months ago

ppbrown commented 7 months ago

You have VIT-B, VIT-L... but most intersting latest thing is VIT-G. Please add support?