Closed ZhangYuanhan-AI closed 2 months ago
LLaVA-OV-0.5B AI2D, MME
LLaVA-OV-0.5B
VideoMME(wo sub),mlvu
All the results have been matched for the papers.
@Luodian
Why uniformly sample? We don't want it to be consistent across different videos?
Update video code