issues
search
MMMU-Benchmark
/
MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
https://mmmu-benchmark.github.io/
Apache License 2.0
327
stars
21
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update infer_llava_onevision.py
#36
Zheng0428
closed
1 week ago
0
Update README.md
#35
Zheng0428
closed
1 week ago
0
Update README.md
#34
Zheng0428
closed
1 week ago
0
Enquiry about the usage about your dataset
#33
tunantu
closed
1 week ago
1
add mmmu pro
#32
Zheng0428
closed
1 week ago
0
add mmmu-pro
#31
Zheng0428
closed
2 weeks ago
0
Add validation set to EvalAI
#30
dchichkov
opened
1 month ago
3
ls
#29
yuanze-lin
closed
1 month ago
0
.tsv file
#28
beichenzbc
closed
2 months ago
0
validation_Materials_25 answer seems wrong?
#27
Zarjagen
closed
3 months ago
2
Update Skywork-VL
#26
yu-changqian
closed
3 months ago
1
How to convert images and prompt to the HF parquet?
#25
Gumpest
closed
3 months ago
2
GPT4o
#24
dirtycomputer
closed
4 months ago
3
Can you release more result files from the validation leaderboard?
#23
kyleliang919
closed
4 months ago
1
There's an error in one of the Correct Examples for genetics
#22
eabase
closed
4 months ago
3
Fail to connect to the homepage: https://mmmu-benchmark.github.io/
#21
shannany0606
closed
5 months ago
2
No (supported) data files found in /MMMU/MMMU
#20
Xiaolong-RRL
closed
6 months ago
3
gpt4v refuse to answer/ insist on "I'm sorry, but I'm unable to view images" these kind of things
#19
SweetGUOguo
closed
6 months ago
2
RuntimeError: The size of tensor a (162) must match the size of tensor b (7) at non-singleton dimension 1
#18
nrikoh
closed
4 months ago
17
PNG files does not convert to RGB
#17
y-vectorfield
closed
7 months ago
4
process_single_sample function's question
#16
bruceisme
closed
7 months ago
2
Request for answer_dict.json for test and dev
#15
boxin-wbx
closed
8 months ago
1
Link for the open source methods in Leaderboard
#14
XiongweiWu
closed
4 months ago
1
Image and JSON dataset.
#13
sxj1215
closed
8 months ago
6
How was "prompt engineering" performed?
#12
mckinziebrandon
closed
9 months ago
2
Mismatch of the data label in Eval code
#11
XiongweiWu
closed
9 months ago
1
Representing LLaVa-1.5-13b
#10
teasgen
closed
9 months ago
7
Model Evaluatation
#9
Rubics-Xuan
closed
9 months ago
11
Question about "Text as Input"
#8
fxmeng
closed
9 months ago
2
Why is every answer in Structural Engineering just "?"
#7
mckinziebrandon
closed
9 months ago
1
How are the image types defined and labeled?
#6
CCYChongyanChen
closed
9 months ago
2
Prompts
#5
teasgen
closed
9 months ago
3
Error reports when loading the dataset
#4
XiongweiWu
closed
9 months ago
4
model evaluation
#3
mactavish91
closed
9 months ago
2
Update README.md
#2
eltociear
closed
9 months ago
1
Evaluation Prompt for mPLUG-Owl2
#1
vateye
closed
9 months ago
8