issues
search
FudanDISC
/
ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
Apache License 2.0
32
stars
4
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Discrepancy between the paper and listed benchmarks in the repo
#8
zhimin-z
closed
8 months ago
1
What is `VQA-MT` dataset?
#7
zhimin-z
closed
9 months ago
0
What is `A-OKVQAR` dataset?
#6
zhimin-z
closed
9 months ago
0
What is `A-OKVQRA` dataset?
#5
zhimin-z
closed
9 months ago
0
What is `K-ViQuAE` dataset?
#4
zhimin-z
closed
9 months ago
0
Any evaluation of closed source models?
#3
zhimin-z
opened
10 months ago
0
[Usage] This image does not exist, please check
#2
yongxinwang-ai
closed
8 months ago
1
The test results of lynx on the MSCOCO ITM task are questionable
#1
OPilgrim
opened
10 months ago
4