aurooj / WSG-VQA-VLTransformers

Weakly Supervised Grounding for VQA in Vision-Language Transformers
MIT License
16 stars 2 forks source link