TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
Apache License 2.0
133 stars 22 forks source link

Why dont use chat template for chat model? #15

Closed eyuansu62 closed 2 months ago

eyuansu62 commented 3 months ago

It is an uncommon approach to directly input a prompt string into a chat model without using a chat template, which may result in unexpected behavior.

wenhuchen commented 2 months ago

We tried with chat template, the performance is lower for few-shot learning. The chat template are mostly used in zero-shot for some models.