[Question] Question on Accuracy data of SeedBench

mu-cai / matryoshka-mm

Matryoshka Multimodal Models

https://matryoshka-mm.github.io/

Apache License 2.0

81 stars 4 forks source link

[Question] Question on Accuracy data of SeedBench #4

Open ChimpOnCloud opened 2 months ago

ChimpOnCloud commented 2 months ago

Question

Table 1 in this paper showed that the accuracy of SeedBench on vanilla LLaVA-1.5-7B is 60.5. However, when I tried to reproduce this one, I got 66.2 which is higher than that. No response

mu-cai commented 2 months ago

Thanks for your question. We reported the SEED accuracy on Total category, instead of image category.

You can also see this from lmms-eval official results. https://docs.google.com/spreadsheets/d/1a5ImfdKATDI8T7Cwh6eH-bEsnQFzanFraFUgcS9KHWc/edit?gid=0#gid=0

I believe if you run evaluations using lmms-eval for both llava-1.5-7b and M3-7b, you will arrive at the same conclusion.