OpenThaiGPT / openthaigpt-pretraining

Apache License 2.0
21 stars 10 forks source link

Complete e2e Evaluation pipeline #182

Open ArthurMinovsky opened 1 year ago

ArthurMinovsky commented 1 year ago

Check list for evaluation pipeline

1. Automatic Evaluation Thai Tasks:

Sequence Classification

QA

Classification

Summarization

Translation

Token Classification

Expected outcome

  1. Convert all tasks above into huggingface dataset format and save to Lanta
  2. Translate all English tasks to Thai. Use mUSE or another encoder to improve translation results if necessary
  3. Create a prompt template using Prompt Source for all tasks listed above
  4. Write a script to evaluate all tasks (Huggingface evaluate library should help) and open PR

2. Automatic Evaluation English Tasks

pitikorn32 commented 1 year ago

Colab: https://colab.research.google.com/drive/13LkwsUX6bomzUdHB-LhLQXPp9qHdLgzv?usp=sharing