Closed madhavatreplit closed 1 year ago
The bigcode-evaluation-harness had some recent updates which break our eval.py's compatibility with the library.
eval.py
Specifically, they added a few new arguments that are checked for downstream in the script.
We update our script to runnable with the latest version of the harness.
Updated
evaluation/eval.py
main.py
Ran and reproduced HE number.
Why
The bigcode-evaluation-harness had some recent updates which break our
eval.py
's compatibility with the library.Specifically, they added a few new arguments that are checked for downstream in the script.
We update our script to runnable with the latest version of the harness.
What changed
Updated
evaluation/eval.py
: applied our changes over the latest version ofmain.py
from the harness.Test plan
Ran and reproduced HE number.
Rollout