IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
261 stars 61 forks source link

Closes #35 | Create dataset loader for IndoNLU NERGrit #264

Closed cahya-wirawan closed 1 year ago

cahya-wirawan commented 1 year ago

Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.

Checkbox

christianwbsn commented 1 year ago

/test dataset=nergrit

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3059521401

cahya-wirawan commented 1 year ago

Hi @bryanwilie, thanks for approving this PR. However, @SamuelCahyawijaya and I agreed to change the name of the dataset to "IndonNLU Nergrit" (update will follow soon) because we have another NERGrit dataset https://github.com/IndoNLP/nusa-crowd/issues/270 (the dataset is different and contains 3 subset datasets) where I will also create its dataloader.

SamuelCahyawijaya commented 1 year ago

Hi Pak @cahya-wirawan, thank you for the update!

Due to a recent update in the master branch, we move all the source code from nusantara/ to nnusa_crowd/, would you help to merge with the master branch and move the dataset to the new folder?

On the other hand, is this PR ready to be reviewed?

cahya-wirawan commented 1 year ago

Ok, I will move to the new folder, and update the citation and License if it's necessary

SamuelCahyawijaya commented 1 year ago

/test dataset=indonlu_nergrit

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3160703619