Open shankyemcee opened 2 years ago
Hi, you can modify VQA.yaml to use your own training annotation.
The problem is with the json files. The keys like ann['answer'] and ann['image'] cant be found. I am using OpenEnded_abstract_v002_train2017_questions.json as training data. Is there some other dataset that was used for VQA fine tuning other than this one? Many things are hardcoded and for now I just want to reproduce the results reported in the paper, so if you would kindly point me to the direction of the dataset, it would be appreciated.
Hi, you need to create your own json file from OpenEnded_abstract_v002_train2017_questions.json
Got it. Thanks
Hi,
I am trying to run the VQA.py file for fine-tuning to the vqa v2 dataset. I am first trying on just the binary balanced abstract scenes due to their smaller size (https://visualqa.org/download.html). I have set the paths in the VQA.yaml file as mentioned. But when I go through the code in the vqa_dataset.py file, the keys being used dont match the keys in the dataset json files. For example here is a snippet of the questions json file:
{ "info": { "description": "This is Balanced Binary Abstract Scenes VQA dataset.", "url": "http://visualqa.org", "version": "1.0", "year": "2017", "contributor": "VQA Team", "date_created": "2017-03-09 14:27:27" }, "task_type": "Open-Ended", "data_type": "abstract_v002", "license": { "url": "http://creativecommons.org/licenses/by/4.0/", "name": "Creative Commons Attribution 4.0 International License" }, "data_subtype": "val2017", "questions": [ { "image_id": 28940, "question": "Is it daylight?", "question_id": 289402 }, { "image_id": 900289402, "question": "Is it daylight?", "question_id": 900289402 },
and here is a snippet of the code for processing data:
Can I know which dataset to use if this is not the one? I just want to try fine tuning the model on a small dataset to get it working so I can train it on another dataset. Thanks.