Open yqy2001 opened 2 weeks ago
Hi, I remember that there is an image replacement issue for ai2d during our development raised by the community. I think at the end we follow the pipeline of the ai2d and perform text replacement.
I took into deep look and got confused on AI2D's data.
Here's a check from another evaluation suite and widely used in the development of InternVL-1.5.
https://github.com/OpenGVLab/InternVL/tree/main/internvl_chat#ai2d-test
I downloaded their AI2D_TEST data and found the image is the same as ours.
confusing +1
Hello! I examined the AI2D dataset used for evaluation and found that a portion of them are unreasonable, suggesting errors in the replacement of options (A, B, ...).
Could you fix this and share your replacement strategy?
Thank you.
Examples: