Closed arjunguha closed 1 year ago
The SantaCoder FIM evaluation with MultiPL-E uses exact match. We should also execute the generated code. The dataset is here:
https://huggingface.co/datasets/bigcode/santacoder-fim-task
All that is needed is is to execute item['prefix'] + generated_solution + item['suffix'] + item['tests'.
item['prefix'] + generated_solution + item['suffix'] + item['tests'
I recommend supporting n samples per item.
n
The SantaCoder FIM evaluation with MultiPL-E uses exact match. We should also execute the generated code. The dataset is here:
https://huggingface.co/datasets/bigcode/santacoder-fim-task
All that is needed is is to execute
item['prefix'] + generated_solution + item['suffix'] + item['tests'
.I recommend supporting
n
samples per item.