[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Could you clarify the main differences between the new dataset and the previous one? Are there any significant changes in the data format, or anything else that I should be aware of?
During my review of the project contents, it appears that files named 'YI_VL_Evaluator.py' and 'YI_Chat_Evaluator.py' are missing. Could you provide an example or let me know where I can find it? Thank you for your assistance and support.
We make adjustments to the data format, mainly key value changes, which have no impact on usage. Also, for the images, we provide local versions instead of links.
We rewrite some of the previous open-ended problems that could not be automatically assessed by experts. The open-ended problems are now available for automatic evaluation. And the documents are divided according to the category characteristics. The above changes are taken into account in the inference code.
We will update these two code files later, please stay tuned.
Could you clarify the main differences between the new dataset and the previous one? Are there any significant changes in the data format, or anything else that I should be aware of?
During my review of the project contents, it appears that files named 'YI_VL_Evaluator.py' and 'YI_Chat_Evaluator.py' are missing. Could you provide an example or let me know where I can find it? Thank you for your assistance and support.