A pipeline to improve skills of large language models
185
stars
41
forks
source link
Added new benchmark AMC23, corrected an error in AIME-2024, and removed leading zeros from AIME-2024 for improved formatting #117
Closed
wedu-nvidia closed 1 month ago
I added the benchmark amc23. Fix one error for aime-2024 for https://artofproblemsolving.com/wiki/index.php/2024_AIME_I_Problems/Problem_12. The correct answer should be 385 not 384 Remove leading zeros for aime-2024 for better formatting