FudanDISC ReForm-Eval issues - Githubissues

FudanDISC / ReForm-Eval

An benchmark for evaluating the capabilities of large vision-language models (LVLMs)

Apache License 2.0

32 stars 4 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Discrepancy between the paper and listed benchmarks in the repo

#8 zhimin-z closed 8 months ago
1
What is `VQA-MT` dataset?

#7 zhimin-z closed 9 months ago
0
What is `A-OKVQAR` dataset?

#6 zhimin-z closed 9 months ago
0
What is `A-OKVQRA` dataset?

#5 zhimin-z closed 9 months ago
0
What is `K-ViQuAE` dataset?

#4 zhimin-z closed 9 months ago
0
Any evaluation of closed source models?

#3 zhimin-z opened 10 months ago
0
[Usage] This image does not exist, please check

#2 yongxinwang-ai closed 8 months ago
1
The test results of lynx on the MSCOCO ITM task are questionable

#1 OPilgrim opened 10 months ago
4