AILab-CVC / SEED-Bench

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
Other
276 stars 8 forks source link

Update for mPLUG-Owl #3

Open MAGAer13 opened 11 months ago

MAGAer13 commented 11 months ago

Thanks for the excellent evaluation work! The results of mPLUG-Owl seems to be the initial release version. Now we have trained in more image-text pairs with the latest version, which shows promising on MMBench. Would you like to try to evaluate it?

We will inform the results to you as well as releasing the checkpoint of latest version.

geyuying commented 10 months ago

Thank you for your attention to our SEED-Bench.

We have released SEED-Bench leaderboard in https://huggingface.co/spaces/AILab-CVC/SEED-Bench_Leaderboard and you can update the results of your models in the leaderboard by following our evaluation instructions.