I evaluated LVChat on Egoschema, but the accuracy is below 0.3. I have tried max_num_frm={16, 96, 160}, and all the other hyper-parameters are set as default in inferency.py, but the accuracy is almost the same.
Meanwhile, the accuracy of VideoChat2 is about 0.45, and the max_num_frm={16, 96}.
I evaluated LVChat on Egoschema, but the accuracy is below 0.3. I have tried max_num_frm={16, 96, 160}, and all the other hyper-parameters are set as default in inferency.py, but the accuracy is almost the same.
Meanwhile, the accuracy of VideoChat2 is about 0.45, and the max_num_frm={16, 96}.