IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
261 stars 61 forks source link

Create dataset loader for ID Abusive Online News Comment #226

Closed SamuelCahyawijaya closed 1 year ago

SamuelCahyawijaya commented 2 years ago

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?id_abusive_news_comment

Dataset id_abusive_news_comment
Description The dataset consists of comments that are in some of the top news stories in 2019, obtained from several online news/forum, such as: kompas, kaskus, and detik. The labeling process is carried out by a total of 10 annotators and each comment is annotated by 3 annotators. Each comment was labeled with one of the following labels: not abusive, abusive but not offensive, abusive and offensive.
License Unknown
wenliangdai commented 2 years ago

self-assign