IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
260 stars 62 forks source link

Closes #224 | Create dataloader for Korpus Nusantara corpus #259

Closed yana-xuyan closed 2 years ago

yana-xuyan commented 2 years ago

Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.

Checkbox

yana-xuyan commented 2 years ago

@SamuelCahyawijaya I paste the chat history below: @yana-xuyan : Thank you for your contribution! For "tiociu" and "khek", since there is no ISO code for those two, let's just keep it in this way.

On the other hand, there seems to be some conflicting files on your commit, can you merge the latest update from the master branch to your branch and push again? Thank you!

SamuelCahyawijaya commented 2 years ago

/test dataset=korpus_nusantara subset_id=korpus_nusantara_ind_day_nusantara_t2t

github-actions[bot] commented 2 years ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071573252

github-actions[bot] commented 2 years ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071574522

SamuelCahyawijaya commented 2 years ago

/test dataset=korpus_nusantara subset_id=korpus_nusantara_ind_day

github-actions[bot] commented 2 years ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071611779

SamuelCahyawijaya commented 2 years ago

/test dataset=korpus_nusantara subset_id=korpus_nusantara_khek_ind

github-actions[bot] commented 2 years ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3071623556

SamuelCahyawijaya commented 2 years ago

@yana-xuyan : Thanks for contributing, the dataset looks good to me! Approving this PR!