SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
55 stars 54 forks source link

Closes #590 | Add Dataloader Thai Elderly Speech #656

Closed akhdanfadh closed 1 month ago

akhdanfadh commented 2 months ago

Closes #590

I implemented one config per subset. Thus, configs will look like this: thai_elderly_speech_healthcare_source, thai_elderly_speech_smarthome_seacrowd_sptext, etc. When testing, pass thai_elderly_speech_<subset> to the --subset_id parameter.

Checkbox

holylovenia commented 1 month ago

Hi @patrickamadeus, I would like to let you know that we plan to finalize the calculation of the open contributions (e.g., dataloader implementations) in 31 hours, so it'd be great if we could wrap up the reviewing and merge this PR before then.

cc: @akhdanfadh

fhudi commented 1 month ago

cc: @sabilmakbar