IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
260 stars 61 forks source link

Create dataset loader for IndoSRL #341

Open SamuelCahyawijaya opened 1 year ago

SamuelCahyawijaya commented 1 year ago

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?indosrl

Dataset indosrl
Description IndoSRL is a semantic role labeling datasets in Indonesian. This dataset provides complete predicate-argument structure for the existing predicates in a sentence. This dataset provides SRL annotations on top of Indonesian GSD corpus from Universal dependencies. This dataset can be used for four tasks: Predicate identification, predicate sense disambiguation, argument identification and argument classification. This data is available in conllup format.
License CDLA-Permissive 1.0