SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
68 stars 57 forks source link

Create dataset loader for CreoleRC #222

Closed SamuelCahyawijaya closed 8 months ago

SamuelCahyawijaya commented 10 months ago

Dataloader name: creole_rc/creole_rc.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?creole_rc

Dataset creole_rc
Description CreoleRC is a subset created by the CreoleVal paper. Relation classification (RC) aims to identify semantic associations between entities within a text, essential for applications like knowledge base completion and question answering. The dataset is sourced from Wikipedia and manually annotated. CreoleRC contains 5 creoles, but SEACrowd is interested specifically in the Chavacano subset.
Subsets -
Languages cbk
Tasks Relation Extraction
License Creative Commons Attribution Share Alike 4.0 (cc-by-sa-4.0)
Homepage https://github.com/hclent/CreoleVal/tree/main/nlu/relation_classification
HF URL -
Paper URL https://arxiv.org/abs/2310.19567
sedrickkeh commented 10 months ago

self-assign

github-actions[bot] commented 10 months ago

Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.

sedrickkeh commented 10 months ago

Working on it. Will try to finish this week

github-actions[bot] commented 9 months ago

Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.

zwenyu commented 8 months ago

self-assign