facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Other
1.17k stars 49 forks source link

ImageNet V2 evaluation #18

Closed samuelstevens closed 10 months ago

samuelstevens commented 10 months ago

One of the proposed benefits of WIT-400M and LAION-400M is that they lead to very strong robustness across distributions. This is typically measured by comparing ImageNet 0-shot performance to ImageNet V2 0-shot, ImageNet-R 0-shot, etc.

Did you evaluate the MetaCLIP models on distribution shifts of ImageNet? Even evaluating on simply ImageNet V2 would give a good idea of the models' robustness. Thanks!

howardhsu commented 10 months ago

we have ImageNet variants eval averaged in table 8 of appendix. For ImageNet v2, MetaCLIP has: L14-400M: 69.8%, L14-1B: 72.5% L14-2.5B: 72.6% (vs OpenAI CLIP L14-400M 69.8%, OpenCLIP L14-400M: 65.4%).

samuelstevens commented 10 months ago

Thanks!