Closed hicleo closed 4 months ago
Seems to have something to do with task_prompt_formatting Change the prompt as follows:
instruction="### Instruction: Summarize the following passage in 3 sentences.\n",
query=f"### Instruction: Summarize the following passage in 3 sentences.\n### Passage: {line['article']}\n### Summary: ",
And this issue can be fixed
Thanks so much for debugging, would you be OK with opening a PR to share this fix with the community?
Not sure if it has something to do with the evaluated model itself. When I use another finetuned model, the original code seems to be fine. Maybe adjusting the task_prompt_formatting ourselves according to requirements is needed.
When running the evaluation of Sheared-LLaMA-1.3B and original LLaMA-7B on helm|summarization:cnn-dm, I get zero scores:
Output: