Issues with IndustryOR Benchmark

Hi there! Thank you so much for pointing out these issues. We truly appreciate your feedback! We've recognized some problems in this benchmark and are currently working on a thorough review. A new version will be released soon.

The benchmark stems primarily from three sources: part of the content comes from textbook exercises, another part from well-known mathematical modeling competitions, and the rest from real-world operations research challenges faced by Cardinal Operations. We've made modifications to these problems to protect client privacy and ensure they fit within the window length limits of large language models. Additionally, many of the original problems and datasets were in Chinese, and we used AI translation to make them accessible to a broader scholar. This translation step may have contributed to the issues as well.

Once again, thank you for your valuable attention and feedback! We're committed to refining the English version to improve its accuracy and will release the updated version soon. Stay tuned!

Cardinal-Operations / ORLM

Issues with IndustryOR Benchmark #3