open-compass / MathBench

[ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset
https://open-compass.github.io/MathBench/
Apache License 2.0
66 stars 1 forks source link

test new model JiuZhang 3.0 #18

Open Lancelot39 opened 2 months ago

Lancelot39 commented 2 months ago

Thanks so much for the exciting benchmark, and I believe it would be an important resource for the research community.

By the way, we have just trained a LLM specially for math via training a data synthesis model, namely JiuZhang3.0. I have attached the download link below, and would you mind testing it on your benchmark? We have released the checkpoints of the 7B and 8X7B versions:

liushz commented 2 months ago

Thanks for your attention to MathBench. We have noticed that your model has impressive performance in mathematics, and we are pleased to conduct tests on MathBench with your model in the coming days.