bigscience-workshop / data_tooling

Tools for managing datasets for governance and training.
Apache License 2.0
77 stars 49 forks source link

Create dataset UIT-VSMEC #181

Closed albertvillanova closed 2 years ago

albertvillanova commented 2 years ago
albertvillanova commented 2 years ago

DONE: https://huggingface.co/datasets/bigscience-catalogue-data/uit_vsmec

Sample:

{'sentence': 'cho mình xin bài nhạc tên là gì với ạ', 'emotion': 4}
lhoestq commented 2 years ago

self-assign

lhoestq commented 2 years ago

Done: https://huggingface.co/datasets/bigscience-catalogue-lm-data/lm_vi_uit_vsmec

sample:

{'text': 'cho mình xin bài nhạc tên là gì với ạ'}
albertvillanova commented 2 years ago

Thanks @lhoestq.