malteos / finetune-evaluation-harness

MIT License
2 stars 0 forks source link

Benchmark: Fine-tune and evaluate LMs #1

Closed malteos closed 1 year ago

malteos commented 1 year ago

Objective: We want to fine-tune language models (like our German GPT) on specific tasks and evaluate them.

Important: This approach is different from the zero-shot / few-shot evaluation as done in lm-evaluation-harness!

Other notes:

Tasks: