SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
64 stars 57 forks source link

Create dataset loader for MALINDO_Morph #193

Closed SamuelCahyawijaya closed 6 months ago

SamuelCahyawijaya commented 9 months ago

Dataloader name: malindo_morph/malindo_morph.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?malindo_morph

Dataset malindo_morph
Description MALINDO Morph merupakan kamus morfologi untuk bahasa Melayu dan bahasa Indonesia. Kamus MALINDO Morph dilesenkan dengan pelesenan Creative Commons Attribution 4.0 International (CC BY 4.0). Untuk maklumat terperinci mengenai MALINDO Morph, sila rujuk makalah di bawah ini.
Subsets Kamus Dewan, Kamus Besar Bahasa Indonesia, Leipzig Corpora Collection, Frogstory-David, Melayu-Standard-Lisan, Melayu-Sabah, Melayu-Sarawak, Melayu-Brunei, Indo-Jakarta-Lisan
Languages zlm, ind
Tasks Morphological Inflections
License Creative Commons Attribution 4.0 (cc-by-4.0)
Homepage https://github.com/matbahasa/MALINDO_Morph
HF URL -
Paper URL http://lrec-conf.org/workshops/lrec2018/W29/pdf/8_W29.pdf
MJonibek commented 9 months ago

self-assign

danjohnvelasco commented 9 months ago

self-assign

github-actions[bot] commented 8 months ago

Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.