IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
261 stars 61 forks source link

Closes #270 | Create dataset loader for NERGrit #295

Closed cahya-wirawan closed 1 year ago

cahya-wirawan commented 1 year ago

Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.

Checkbox

cahya-wirawan commented 1 year ago

This nergrit dataset is more comprehensive than the indonlu_nergrit dataset. It also contains three subset datasets NER, Sentiment, and Statement. Following is the comparison:

indonlu_nergrit:

nergrit:

SamuelCahyawijaya commented 1 year ago

/test dataset=nergrit subset_id=nergrit_ner

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3163009802

SamuelCahyawijaya commented 1 year ago

/test dataset=nergrit subset_id=nergrit_sentiment

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3163027504

SamuelCahyawijaya commented 1 year ago

/test dataset=nergrit subset_id=nergrit_statement

github-actions[bot] commented 1 year ago

Run result

Check test log here: https://github.com/IndoNLP/nusa-crowd/actions/runs/3163033929