UMT_QA Usage - Githubissues

Hi,

I have been working on taking VideoMamba's multi-modal pre-trained weights and applying it to other VQA downstream tasks/datasets. So far, I have been using the UMT_VideoMamba model as this is the model that is compatible with the provided pre-trained weights, but I stumbled upon the UMT_QA model which appears to be specifically tailored for VQA (i.e processes both questions and answers simultaneously + ranking) along with what appears to be outdated qa config files in the 'configs' folder. Before I spend time looking into this UMT_QA model, I just wanted to confirm whether it was ever used, or if it is deprecated. I was not able to find any references to the model in the repo besides initialization, so I am assuming it was just a research idea that ended up not being used. Thanks in advance for your help!

OpenGVLab / VideoMamba

UMT_QA Usage #31