Test quality of data for each eval task - Githubissues

OpenThaiGPT / openthaigpt-pretraining

Apache License 2.0

21 stars 10 forks source link

Test quality of data for each eval task #272

Open ArthurMinovsky opened 1 year ago

ArthurMinovsky commented 1 year ago

Huggingface dataset (filtered)

filtered dataset from mUSE score

[ ] [Patt/HellaSwag_TH_drop](https://huggingface.co/datasets/Patt/HellaSwag_TH_drop) → The row that any score < 0.5 was dropped
[ ] [**Patt/MultiRC_TH_drop](https://huggingface.co/datasets/Patt/MultiRC_TH_drop)** → The row that any score < 0.66 was dropped.
[ ] [**Patt/RTE_TH_drop](https://huggingface.co/datasets/Patt/RTE_TH_drop)** → The row which score_hypothesis <= 0.5 or score_premise <= 0.7 was dropped.
[ ] [Patt/ReCoRD_TH_drop](https://huggingface.co/datasets/Patt/ReCoRD_TH_drop) → Drop every row that score_answers < 0.8 and every row that score < 0.5 after penalty.