Open Vicent0205 opened 1 month ago
Georgefwt
Thank you!
Additionally, in this dataset, 1437717772.jpg seems to be corrupted and needs to be downloaded again:
wget http://ecx.images-amazon.com/images/I/51YTH4k3fUL.jpg
cp 51YTH4k3fUL.jpg playground/data/ocr_vqa/images/1437717772.jpg
Thank you
thank you immensely helpful
Question
When I conduct finetuning using the mix665k.json file. I find that there are some images for ocr vqa do not exist! I find that there are 80,000 ocr_vqa data in mix665k file, while images of 355 data does not exist using the given download script.