SEACrowd / seacrowd-datahub

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Apache License 2.0
65 stars 57 forks source link

Ishan/utf shared: This is to update the Utf encoding for shared utilitiy #328

Closed ijindal closed 8 months ago

ijindal commented 9 months ago

This PR relates to https://github.com/SEACrowd/seacrowd-datahub/pull/247#discussion_r1440043322 and is linked to #247

Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.

Checkbox

sabilmakbar commented 9 months ago

Thanks for proposing changes in our shared utility. One note: do you mind splitting the code updates between the dataloader & config changes? thx!

ijindal commented 9 months ago

Sure. I will have separate these two efforts

ijindal commented 8 months ago

closing in lieu of #333