SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
65 stars 57 forks source link

Create dataset loader for IndoQA #430

Closed SamuelCahyawijaya closed 7 months ago

SamuelCahyawijaya commented 8 months ago

Dataloader name: indoqa/indoqa.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?indoqa

Dataset indoqa
Description IndoQA is an Indonesian question-answering dataset. It is comprised of ~4.5k examples. The datasets consists of a context paragraph along with an associated question-answer pair.
Subsets -
Languages ind
Tasks Question Answering
License Creative Commons Attribution No Derivatives 4.0 (cc-by-nd-4.0)
Homepage https://huggingface.co/datasets/jakartaresearch/indoqa
HF URL https://huggingface.co/datasets/jakartaresearch/indoqa
Paper URL https://huggingface.co/datasets/jakartaresearch/indoqa
fhudi commented 8 months ago

self-assign

github-actions[bot] commented 7 months ago

Hi @, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.