open-compass / MMBench

Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
Apache License 2.0
163 stars 10 forks source link

Confused by online results #4

Closed FeipengMa6 closed 1 year ago

FeipengMa6 commented 1 year ago

I submit the prediction result of test set. And the evaluation results also report the dev_overall score, what does that mean?

image