test new model JiuZhang 3.0

open-compass / MathBench

[ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset

Apache License 2.0

66 stars 1 forks source link

Thanks so much for the exciting benchmark, and I believe it would be an important resource for the research community.

By the way, we have just trained a LLM specially for math via training a data synthesis model, namely JiuZhang3.0. I have attached the download link below, and would you mind testing it on your benchmark? We have released the checkpoints of the 7B and 8X7B versions:

The 7B version based on Mistral-7B: https://huggingface.co/ToheartZhang/JiuZhang3.0-7B
The MOE version based on Mixtral-8X7B: https://huggingface.co/ToheartZhang/JiuZhang3.0-8x7B

open-compass / MathBench

test new model JiuZhang 3.0 #18