SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
57 stars 54 forks source link

Create dataset loader for VSoLSCSum #580

Closed SamuelCahyawijaya closed 2 months ago

SamuelCahyawijaya commented 3 months ago

Dataloader name: vsolscsum/vsolscsum.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?vsolscsum

Dataset vsolscsum
Description The Vietnamese dataset for social context summarization The dataset contains 141 open-domain articles along with 3,760 sentences, 2,448 extracted standard sentences and comments as standard summaries and 6,926 comments in 12 events. This dataset was manually annotated by human. Note that the extracted standard summaries also include comments.
Subsets -
Languages vie
Tasks Summarization
License Creative Commons Attribution 4.0 (cc-by-4.0)
Homepage https://github.com/nguyenlab/VSoLSCSum-Dataset
HF URL -
Paper URL https://aclanthology.org/W16-5405/
muhammadravi251001 commented 3 months ago

self-assign