paperswithlove / papers-we-read

3 stars 0 forks source link

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AG #36

Open runhani opened 1 month ago

runhani commented 1 month ago

역시 original을 먼저 읽었어야 했나?

image

image

그래서 data를 어떻게 모았다고?

stat

total art & design business science health & medicine human & social sci. tehc & eng.
validation 900
test 10500 1163 1428 2426 1752 947 2784

conclusions