Closed jchsun1 closed 3 hours ago
Hi,
Thank you for your interest.
The MLVU dataset consists of two sets: the dev set and the test set. The multiple-choice questions in MLVU_Test include 6 options to increase the difficulty. Additionally, the test set does not provide ground truth to ensure a fairer evaluation. You can refer to the example at https://github.com/JUNJIE99/MLVU/blob/main/evaluation_test/test_res.json to organize your prediction results and submit the result file to us for evaluation.
The repository contains evaluation code for both MLVU(dev) and MLVU_Test.
Since ground truth is not provided for the MLVU Test set, please organize your prediction results according to the example at https://github.com/JUNJIE99/MLVU/blob/main/evaluation_test/test_res.json and submit them to us for evaluation.
Thank you.
Thanks for you quick reply and guidance, I can solve my problems now.
Hello, thanks for your excellent work and I have some questions here:
I'm looking forward to your answers. Thank you!