facebookresearch / mmbt

Supervised Multimodal Bitransformers for Classifying Images and Text
Other
243 stars 52 forks source link

Hello, can you provide a link to the dataset? for example,MM-IMDB,FOOD101,V-SNLI #8

Open 15779235038 opened 3 years ago

douwekiela commented 3 years ago

Food101: http://visiir.lip6.fr/data/public/UPMC_Food101.tar.gz (http://visiir.lip6.fr/) MMIMDB: http://lisi1.unal.edu.co/mmimdb/ V-SNLI: https://arxiv.org/pdf/1806.05645.pdf (you need the SNLI dataset from https://nlp.stanford.edu/projects/snli/ and the Flickr30k images from http://shannon.cs.illinois.edu/DenotationGraph)