fix: change prompt_template with summarize instruction

taturabe commented 8 months ago

Issue #, if available:

In this example of Summarization accuracy evaluation, prompts "Human: $feature\n\nAssistant:\n\" are used. However, since this prompt has no instruction to generate a summary in one sentence, the output of the model produces a result that is far from a summary.

As a way to run a module to perform Evaluation, this prompt is not problematic. In fact, the evaluation method succeeds. However, this prompt could be misinterpreted as the SummarizationAccuracy class having an internally preset instruction of summarization.

Description of changes:

This pull request change makes it clear that if a builtin dataset is not used, the user must set the instruction to the prompt_template to match the instruction to the task. Or, the user must set nothing for prompt_template so that fmeval uses default template (It is no clear for me that default one is suitable for each task.)

Also, the model output of this PR change will return an appropriate one-sentence summary, and the evaluation score will be improved. This makes the example more relevant to actual use cases.