OpenGVLab / Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
419 stars 30 forks source link

MiniGPT-4 and LLaVA evaluation #5

Open waltonfuture opened 1 year ago

waltonfuture commented 1 year ago

Hi! I'm a fan of your work. Can you please provide more details about how to do eval for MiniGPT-4 and LLaVA on various datasets? Thanks a lot!

BellXP commented 1 year ago

Thank you for your attention, we have updated the details about using models and doing evaluation. The details about getting the datasets are still ongoing.