Hi, thank you for the wonderful work. I wonder how to train on images with 384×384 resolutions. As far as I know, openai has not released CLIP model with 384 resolution. CLIP from timm, however, does not contain the corresponding text encoder. Is there any other released pretrained CLIP weights?
Hi, thank you for the wonderful work. I wonder how to train on images with 384×384 resolutions. As far as I know, openai has not released CLIP model with 384 resolution. CLIP from timm, however, does not contain the corresponding text encoder. Is there any other released pretrained CLIP weights?