OpenGVLab / Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
404 stars 30 forks source link

Code for VCR evaluation #8

Open dongmean opened 11 months ago

dongmean commented 11 months ago

First, I really appreciate for your great contributions in LVLM field.

Do you have any plan to release the visual commonsense reasoning (VCR) evaluation code? There's some elaboration about how to properly locate and download the dataset, but I couldn't find the corresponding code.

Thanks again for your work.

BellXP commented 10 months ago

Thank you for your question, the code for VCR evaluation has been updated in LVLM_evaluation/Multi_turn_Reasoning.