meta-math / MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
https://meta-math.github.io
Apache License 2.0
378 stars 33 forks source link

Are the MetaMath Eval Result valid? e.g., What is the standard way to evaluate on MATH? #33

Open brando90 opened 4 months ago

brando90 commented 4 months ago

I see there is code here for that, but is that really the standard way to evaluate on MATH? How did you make sure your results were valid if you didn't use a standard & tested eval for MATH?