AILab-CVC / SEED-Bench

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
Other
276 stars 8 forks source link

Evaluating latest version of OpenFlamingo #2

Closed anas-awadalla closed 10 months ago

anas-awadalla commented 11 months ago

Congrats on the awesome work!

Similar to #1, it seems that you are evaluating the now deprecated version of OpenFlamingo. We have a much better model now. It would be cool to see how that one compares to the others.

geyuying commented 10 months ago

Thank you for your attention to our SEED-Bench.

We have released SEED-Bench leaderboard in https://huggingface.co/spaces/AILab-CVC/SEED-Bench_Leaderboard and you can update the results of your models in the leaderboard by following our evaluation instructions.