IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
260 stars 61 forks source link

Create dataset loader for NusaKalimat #346

Closed SamuelCahyawijaya closed 1 year ago

SamuelCahyawijaya commented 1 year ago

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?nusa_kalimat

Dataset nusa_kalimat
Description NusaKalimat is a machine translation sentence-level datasets which covers 11 local languages in Indonesia.
License CC-BY-NC-SA 4.0
SamuelCahyawijaya commented 1 year ago

We need to restructure the entry in the nusacatalogue for this dataset. closing this issue for now