SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
65 stars 57 forks source link

Closes #351 | Implemented dataloader for indoler #378

Closed luckysusanto closed 7 months ago

luckysusanto commented 8 months ago

Closes #351

Note: Despite only having 993 data instances, there are almost 5.9M tokens that need to be processed. the testcase ran successfully, but took my laptop 150s. test case If anyone knows how I can optimize this, please do tell me.

Checkbox

luckysusanto commented 8 months ago

Requesting review! @holylovenia