OpenThaiGPT / openthaigpt-pretraining

Apache License 2.0
21 stars 10 forks source link

Test quality of data for each eval task #272

Open ArthurMinovsky opened 1 year ago

ArthurMinovsky commented 1 year ago

Huggingface dataset (filtered)

filtered dataset from mUSE score