microsoft / Oscar

Oscar and VinVL
MIT License
1.04k stars 251 forks source link

Image idx and image file name matching error #155

Open xuehaoliu109 opened 2 years ago

xuehaoliu109 commented 2 years ago

Dear OSCAR team:

When I was recreating your BERT pre-train work, it seemed that the idx of images in coco_flickr30k_googlecc_gqa_sbu_oi.tsv does not match the file name in the image features.tsv file. This problem will cause the whole data preprocessing work to fail. Do you know how to fix this problem? Both the coco_flickr30k_googlecc_gqa_sbu_oi.tsv and features.tsv files are downloaded from the VinVL_DOWNLOAD.md file.

Looking forward to your reply.

Best regards, Xuehao Liu