HI,
Thank you for your open source work. I am also very interested in your work. Regarding your work, I have a question: When you use Covnext as the image encoder to pre-train CLIP, the weights of convnext use ImageNet pre-trained initialization or random initialization?
HI, Thank you for your open source work. I am also very interested in your work. Regarding your work, I have a question: When you use Covnext as the image encoder to pre-train CLIP, the weights of convnext use ImageNet pre-trained initialization or random initialization?