-
Hey,
I am Zhiqiu Lin, a final-year PhD student at Carnegie Mellon University working with Prof. Deva Ramanan. Your work is very interesting with great performance gains!
I wanted to share [Natu…
-
Do we need to consider the tendency or prior to say "Yes" ?
Like P(Yes | text, image)/ P(Yes |null ,image) or P(Yes | text, image) - P(Yes |null ,image) , something like these?
-
I would like to conduct object detection task by utilizing a VQA model using autotrain API. I followed this [guide](https://huggingface.co/blog/abhishek/paligemma-finetuning-autotrain). Accordingly, I…
-
from vpa import VQA, but where is vqa? or How can I install vqa package?
-
Hi, when will the weights for blip-2 fine-tuned for VQA be released? We've been waiting for quite some time. I'd really appreciate it if they can be released as soon as possible. :) Thanks!
-
Thank you so much for your great work!
Are there some official scripts to process VQA labels into conversation format for the 3 VQA datasets? Or could you please provide the pre-processed json file…
-
Hello, thanks for your excellent work. I'm reproducing the results in the repo. I found that the vqa_train annotation files differ from the original VQAv2 annotations. There are some answers in vqa_tr…
-
# ComfyUI Error Report
## Error Details
- **Node Type:** Qwen2_VQA
- **Exception Type:** TypeError
- **Exception Message:** can only concatenate str (not "list") to str
-
Hope to access the 2D VQA and Image-Text Retrieval Task
-
I could not access the dataset. could you please provide more instructions since it is not uploaded in English portal?