GeWu-Lab / MWAFM

Multi-Scale Attention for Audio Question Answering
27 stars 1 forks source link

Meaning of metadata internal file #2

Closed zsw111-zzz closed 1 year ago

zsw111-zzz commented 1 year ago

Thanks a lot for your great contribution, but I'm having some issues reproducing your work. For the metadata part in the code base, does the source data contained inside come from the Clotho-AQA dataset or the AQA-MUSIC-AVQA dataset? Do the files starting with single_word, binary_test, clothho_aqa and other flags have any special meaning? Looking forward to your reply, thank you again!

ayameyao commented 1 year ago

We examined the original annotation files of Clotho-AQA and found that the official open-source annotations were not cleansed, resulting in discrepancies where different annotators provided different answers for the same question. As a result, we performed a simple filtering process where we considered a question to have the correct answer if it had at least two identical answers Based on this filtering process, we obtained a new and more accurate annotation file. The files in 'metadata' folder are described as follows

We have updated the detailed description in the 'readme.md' file, please check it.

zsw111-zzz commented 1 year ago

Your reply perfectly solved my problem, thank you very much for your reply!