SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
55 stars 54 forks source link

Closes #474 | Add Dataloader OKAPI mARC #652

Closed SamuelCahyawijaya closed 2 months ago

SamuelCahyawijaya commented 2 months ago

Closes #474

Checkbox

Test Indonesian subset: ./test_example.sh okapi_m_arc --subset_id okapi_m_arc_ind Test Vietnamese subset: ./test_example.sh okapi_m_arc --subset_id okapi_m_arc_vie