hendrycks / math

The MATH Dataset (NeurIPS 2021)
MIT License
879 stars 85 forks source link

Wrap is_equiv in a Python package. #8

Closed hacobe closed 3 years ago

hacobe commented 3 years ago

I plan to submit a new metric to Hugging Face that will compute accuracy treating equivalent answers (as defined by is_equiv) as correct. Without this change, Hugging Face users would have to download the math directory and add it to their PYTHONPATH before using the new metric. Instead, they'll be able to install the package from github using pip.

As part of this change, I've also: