IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
262 stars 62 forks source link

Create dataset loader for MaRVL #165

Open SamuelCahyawijaya opened 2 years ago

SamuelCahyawijaya commented 2 years ago

https://indonlp.github.io/nusa-catalogue/card.html?marvl

wenliangdai commented 2 years ago

self-assign

bryanwilie commented 2 years ago

Hi @wenliangdai , are you still working on this? I will assume inactivity if there's no reply and will free the assignees. Thanks!

wenliangdai commented 2 years ago

@bryanwilie Hi Bryan, there is a license issue with this dataset. The official links for downloading data are temporarily created every time, we need re-upload them somewhere. Samuel is asking permission from the paper authors.

bryanwilie commented 2 years ago

Hi @wenliangdai. Noted on the issue. Thank you for the communication and let's wait on the permission then.

Thank you for joining us @wenliangdai!