Open sunyuhan19981208 opened 12 months ago
I got only 9.7% for llama2-7B-chat on human-eval using your script
{'pass@1': 0.0975609756097561}
Hi, I think you will have to make sure the prompt template is correct
I got only 9.7% for llama2-7B-chat on human-eval using your script