SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
64 stars 57 forks source link

Closes #448 | Add/Update Dataloader alorese #541

Closed patrickamadeus closed 5 months ago

patrickamadeus commented 6 months ago

Closes #448

Checkbox

T2T

image

SPTEXT

image

SPTEXT_TRANS

image

sabilmakbar commented 5 months ago

wait I'm going to check it quickly, pardon for late response

sabilmakbar commented 5 months ago

Hi @patrickamadeus, I already put in an updated review. Let both of us know if the suggestion has been addressed, prob both me and LJ need to re-run the whole checking once more to ensure it's already correct since this data loader is quite complex. Thx!

sabilmakbar commented 5 months ago

Hi @patrickamadeus, all looks good to me. Since LJ said he doesn't have much PC storage left (presumably), I'll proceed with the merge :) (I am able to download all data & subsets and tested it too).

How does it sound, @ljvmiranda921? If that's fine from your end, I'll approve and merge it

ljvmiranda921 commented 5 months ago

^Yes please feel free to merge! 🙇