IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
260 stars 62 forks source link

Closes #224 | Create dataloader for Korpus Nusantara corpus #239

Closed yana-xuyan closed 2 years ago

yana-xuyan commented 2 years ago

Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.

Checkbox

yana-xuyan commented 2 years ago

The Updated progress bar commit is made by "README-bot". I don't really know why it appears automatically. :(

yana-xuyan commented 2 years ago

Hi, thank you for checking the code. For the first point, I gonna revise the code. For the second point, actually tiociu and khek don't have the corresponding ISO code, so I left them as they are. I also have no idea about both language but seems they are dialects in Indonesia.