ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Thank you for your great work!
I noticed that the two files data/wiki/bigram_pmi_cache.txt.gz and data/wiki/2gram.txt.gz are used in the metaclip/build_metadata.py, but where to download?
Thank you for your great work! I noticed that the two files data/wiki/bigram_pmi_cache.txt.gz and data/wiki/2gram.txt.gz are used in the metaclip/build_metadata.py, but where to download?