This repository is the implementation of "RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering".
This repository is based on and inspired by @microsoft's work . We sincerely thank for their sharing of the codes.
Download pytorch bert-base-uncased model from huggingface, then extract it into #root/source, where #root is the code root folder.
The directory structure is as follows:
pip3 install -r requiresments.txt
cd #root
python main_test.py
The result of ST-VQA task3 test will be saved in conf~/model/submission.json