Closed LorenzoAgnolucci closed 1 year ago
Hi, thanks for publishing the code, I find your work very interesting!
I have one question: which CLIP model did you use to obtain the results reported in the paper? ViT-L/14 or ViT-L/14@336px?
Thanks in advance.
Thanks for your interest. We used ViT-L/14.
Thanks!
Hi, thanks for publishing the code, I find your work very interesting!
I have one question: which CLIP model did you use to obtain the results reported in the paper? ViT-L/14 or ViT-L/14@336px?
Thanks in advance.