Closed JosephPai closed 3 months ago
Hi authors,
Thanks for releasing the code. I noticed that you mentioned "Note that our image encoder is the same as OpenCLIP. Not as fine-tuned as other modalities." I would like to know what is the exact version of CLIP weight are you using?
Thanks!
For CLIP-L: https://huggingface.co/laion/CLIP-ViT-L-14-DataComp.XL-s13B-b90K
For CLIP-H: https://huggingface.co/laion/CLIP-ViT-H-14-laion2B-s32B-b79K
Hi authors,
Thanks for releasing the code. I noticed that you mentioned "Note that our image encoder is the same as OpenCLIP. Not as fine-tuned as other modalities." I would like to know what is the exact version of CLIP weight are you using?
Thanks!