OpenThaiGPT / openthaigpt-pretraining

Apache License 2.0
21 stars 10 forks source link

Running Eval Task by each model #271

Open ArthurMinovsky opened 1 year ago

ArthurMinovsky commented 1 year ago

To do

ArthurMinovsky commented 1 year ago

Thai dataset

ArthurMinovsky commented 1 year ago

Huggingface dataset (filtered)

filtered dataset from mUSE score

[Patt/HellaSwag_TH_drop](https://huggingface.co/datasets/Patt/HellaSwag_TH_drop)

→ The row that any score < 0.5 was dropped

[**Patt/MultiRC_TH_drop](https://huggingface.co/datasets/Patt/MultiRC_TH_drop)**

→ The row that any score < 0.66 was dropped.

[**Patt/RTE_TH_drop](https://huggingface.co/datasets/Patt/RTE_TH_drop)**

→ The row which score_hypothesis <= 0.5 or score_premise <= 0.7 was dropped.

[Patt/ReCoRD_TH_drop](https://huggingface.co/datasets/Patt/ReCoRD_TH_drop)

→ Drop every row that score_answers < 0.8 and every row that score < 0.5 after penalty.

Pattptr commented 1 year ago

Dataset checklist

Thai

EN -> TH dataset (wait for quality checking)