Closed Caizifen closed 1 year ago
How to download ocr_vqa? what is pdb?
Apologies for the confusion. I just re-calculated the exact samples in the dataset mixture, and we will update the paper to correct the sample count for RefCOCO and A-OKVQA. Note that the released dataset is correct, only the number reported in the table is off for these two datasets.
Dataset | Actual | Paper |
---|---|---|
LLaVA | 157712 | 158K |
SG40k | 40688 | 40K |
VQA-v2 | 82783 | 83K |
GQA | 72140 | 72K |
OKVQA | 8998 | 9K |
OCRVQA | 80000 | 80K |
A-OKVQA | 66160 | |
TextCaps | 21953 | 22K |
RefCOCO | 48447 | |
VG | 86417 | 86K |
Total | 665298 | 665K |
@Caizifen Hello, I'm also confused about the difference between README and the table mentioned by @haotian-liu. Have you clarified it? And does the data mentioned in the README have contained all the data in the table?
How to download ocr_vqa? what is pdb?
Hi, I want to know if you have solved this problem? i have encountered the same problem
Question
The above is the structure of the fine-tuning dataset provided. After I downloaded the data according to the README, the total number is not 665k, only 608k. Did I miss anything?