-
Could anyone tell me where I can download the VQA-CP v2 dataset? The link in the README file is dead.
Thanks!
-
This is really great work! I ran it on the RAD-VQA dataset without pre-training and obtained lower accuracy rates: 39.1% for open questions, 65.4% for closed questions, and 54.9% overall. I would like…
-
Hi, I'm here again.
This time I meet a new question.
I want to finetune the vqa checkpoints with my own data. There are two questions confused me.
The 1st one is that which init checkpoints should…
-
你好,很不错的工作,想问一下有论文中模型量化的相关脚本么
-
See #44 for more details about the dataset and network architecture.
The task is:
1. Train a baseline VQA model with decent accuracy. Try using tools from MONAI as much as possible. (related to …
-
Hi @jmwang0117,
Thank you for sharing your work on HE-Drive; it’s very impressive! I noticed that the code repository currently doesn’t include the VLM-guided scoring section, specifically the comp…
-
Hello, thanks for your excellent work!
I'm currently running VQA_RAD and PATH-VQA.
Despite metrics have finished calculated, the progress bar fails to update.
Could you please advise on how t…
-
FileNotFoundError: [Errno 2] Unable to open file (unable to open file: name = './data/PEIR\text_features.h5', errno = 2, error message = 'No such file or directory', flags = 0, o_flags = 0)
-
When I try to execute the below code:
_```
from donut import DonutModel
import torch
from PIL import Image
pretrained_model = DonutModel.from_pretrained("naver-clova-ix/donut-base")
if tor…
-
To use EgoVLPv2 (specifically EgoTaskQA) on custom VQA dataset,
it is essential to preprocess the metadata about questions ans answers.
Specifically the "answer_encode" is needed.
How can I encod…