Open Cece1031 opened 4 months ago
We detail our experiments on Music AVQA in Appendix (released soon). Briefly, we do not use in-context reasoning, but follow the baseline recognition paradigm for classification.
I wonder which split you use for evaluation for music AVQA dataset? Test ot Val? Thanks!
I didn't quite understand how you compare with the prior model? Could you tell me your email so I can ask about this in detail?Thank u very much