IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
260 stars 61 forks source link

Create dataset loader for IndQNER #329

Closed SamuelCahyawijaya closed 1 year ago

SamuelCahyawijaya commented 1 year ago

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?indqner

Dataset indqner
Description IndQNER is a NER dataset created by manually annotating 8 chapters in the Indonesian translation of Quran text. The dataset consists of 2476 named entities from 18 categories. Each named entity is labeled using BIO (Beginning-Inside-Outside) tagging format.
License Unknown
RiaGusmita commented 1 year ago

self-assign

SamuelCahyawijaya commented 1 year ago

Closed in https://github.com/IndoNLP/nusa-crowd/pull/326