SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
63 stars 57 forks source link

Create dataset loader for TICO-19 #49

Closed SamuelCahyawijaya closed 6 months ago

SamuelCahyawijaya commented 10 months ago

Dataloader name: tico_19/tico_19.py DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?tico_19

Dataset tico_19
Description Translation Initiative for Covid-19 (TICO-19) have made data available to AI and MT researchers in 38 different languages in order to foster the development of tools and resources for improving access to information about Covid-19 in these languages. The benchmark includes 30 documents (3071 sentences, 69.7k words) translated from English into 37 languages, including 6 languages spoken in Southeast Asian regions: Indonesian, Khmer (Central), Malay, Myanmar, Tagalog, Tamil.
Subsets TICO-19 id, TICO-19 km, TICO-19 ms, TICO-19 my, TICO-19 tl, TICO-19 ta
Languages ind, khm, zlm, mya, tgl, tam
Tasks Machine Translation
License Creative Commons Zero v1.0 Universal (cc0-1.0)
Homepage https://tico-19.github.io/testset.html
HF URL -
Paper URL https://aclanthology.org/2020.nlpcovid19-2.5/
faridlazuarda commented 10 months ago

self-assign

sabilmakbar commented 9 months ago

Hi @faridlazuarda, may I know the current status of this dataloader creation? Feel free to discuss here if you have any difficulties. Thanks!

github-actions[bot] commented 9 months ago

Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.

sabilmakbar commented 9 months ago

Hi @faridlazuarda, may I know the progress on this? If there's no update from your end or comments on this issue (including a request for additional time to work on this) from you in the next 48hrs, we might consider removing this assignment from you so other contributors can work on this issue.

Again, if you find any difficulties in implementing this, pls let us know so we can help you in any way possible. Thanks!

sabilmakbar commented 9 months ago

Apologies for confusion, @faridlazuarda. You did mention that you're working on all issues assigned to you in this reply, but since you aren't replying on all issues assigned to you, we weren't able to track it. Therefore, I'm reassigning this to you.

cc @SamuelCahyawijaya @holylovenia

bryanwilie commented 8 months ago

self-assign

github-actions[bot] commented 7 months ago

Hi @${assignee}, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.

bryanwilie commented 7 months ago

Still trying to do something this weekend. Will notify further later!

bryanwilie commented 7 months ago

I don't think I could work on this eventually, sorry, I think I'll unassign myself for now.

ssun32 commented 7 months ago

self-assign

github-actions[bot] commented 7 months ago

Hi @, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.