Closed ganler closed 5 months ago
This PR addresses #186 to include HumanEval+ in bigcode-evaluation-harness.
The dataset is available at https://huggingface.co/datasets/evalplus/humanevalplus
Thanks for the prompt code review! The comments have been addressed. :D
I also did a test using starcoderbase-1b and can get 11.5 pass@1 which is very close to what I previously got using EvalPlus :).
starcoderbase-1b
This PR addresses #186 to include HumanEval+ in bigcode-evaluation-harness.
The dataset is available at https://huggingface.co/datasets/evalplus/humanevalplus