SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
65 stars 57 forks source link

Create dataset loader for Indoler #351

Closed SamuelCahyawijaya closed 7 months ago

SamuelCahyawijaya commented 8 months ago

Dataloader name: indoler/indoler.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?indoler

Dataset indoler
Description The dataset contains 993 annotated court decision documents. The document was taken from the Decision of the Supreme Court of Indonesia (https://decision3.mahkamahagung.go.id/) for 5 state courts of DKI Jakarta with a total of 1000 criminal documents cases.
Subsets -
Languages ind
Tasks Named Entiy Recognition
License Unknown (unknown)
Homepage https://github.com/ir-nlp-csui/indoler/tree/main
HF URL -
Paper URL https://ieeexplore.ieee.org/abstract/document/9263157
luckysusanto commented 8 months ago

self-assign