MMMU-Benchmark MMMU issues

MMMU-Benchmark / MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

https://mmmu-benchmark.github.io/

Apache License 2.0

327 stars 21 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Update infer_llava_onevision.py

#36 Zheng0428 closed 1 week ago
0
Update README.md

#35 Zheng0428 closed 1 week ago
0
Update README.md

#34 Zheng0428 closed 1 week ago
0
Enquiry about the usage about your dataset

#33 tunantu closed 1 week ago
1
add mmmu pro

#32 Zheng0428 closed 1 week ago
0
add mmmu-pro

#31 Zheng0428 closed 2 weeks ago
0
Add validation set to EvalAI

#30 dchichkov opened 1 month ago
3
ls

#29 yuanze-lin closed 1 month ago
0
.tsv file

#28 beichenzbc closed 2 months ago
0
validation_Materials_25 answer seems wrong?

#27 Zarjagen closed 3 months ago
2
Update Skywork-VL

#26 yu-changqian closed 3 months ago
1
How to convert images and prompt to the HF parquet?

#25 Gumpest closed 3 months ago
2
GPT4o

#24 dirtycomputer closed 4 months ago
3
Can you release more result files from the validation leaderboard?

#23 kyleliang919 closed 4 months ago
1
There's an error in one of the Correct Examples for genetics

#22 eabase closed 4 months ago
3
Fail to connect to the homepage: https://mmmu-benchmark.github.io/

#21 shannany0606 closed 5 months ago
2
No (supported) data files found in /MMMU/MMMU

#20 Xiaolong-RRL closed 6 months ago
3
gpt4v refuse to answer/ insist on "I'm sorry, but I'm unable to view images" these kind of things

#19 SweetGUOguo closed 6 months ago
2
RuntimeError: The size of tensor a (162) must match the size of tensor b (7) at non-singleton dimension 1

#18 nrikoh closed 4 months ago
17
PNG files does not convert to RGB

#17 y-vectorfield closed 7 months ago
4
process_single_sample function's question

#16 bruceisme closed 7 months ago
2
Request for answer_dict.json for test and dev

#15 boxin-wbx closed 8 months ago
1
Link for the open source methods in Leaderboard

#14 XiongweiWu closed 4 months ago
1
Image and JSON dataset.

#13 sxj1215 closed 8 months ago
6
How was "prompt engineering" performed?

#12 mckinziebrandon closed 9 months ago
2
Mismatch of the data label in Eval code

#11 XiongweiWu closed 9 months ago
1
Representing LLaVa-1.5-13b

#10 teasgen closed 9 months ago
7
Model Evaluatation

#9 Rubics-Xuan closed 9 months ago
11
Question about "Text as Input"

#8 fxmeng closed 9 months ago
2
Why is every answer in Structural Engineering just "?"

#7 mckinziebrandon closed 9 months ago
1
How are the image types defined and labeled?

#6 CCYChongyanChen closed 9 months ago
2
Prompts

#5 teasgen closed 9 months ago
3
Error reports when loading the dataset

#4 XiongweiWu closed 9 months ago
4
model evaluation

#3 mactavish91 closed 9 months ago
2
Update README.md

#2 eltociear closed 9 months ago
1
Evaluation Prompt for mPLUG-Owl2

#1 vateye closed 9 months ago
8