IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
260 stars 62 forks source link

Closes #205 | TALPCo #237

Closed ryanignatius closed 2 years ago

ryanignatius commented 2 years ago

Please name your PR after the issue it closes. You can use the following line: "Closes #ISSUE-NUMBER" where you replace the ISSUE-NUMBER with the one corresponding to your dataset.

Checkbox

ryanignatius commented 2 years ago

Thank you for the dataloader, @ryanignatius!

I encountered an error when I tried to use the dataloader with configuration names using the format talpco_{src_lang}_{tgt_lang}_{schema}. Could you please modify line 133 to _, lang_source, lang_target = self.config.name.replace(f"_{self.config.schema}", "").split("_") to get rid of the error?

Thanks for the feedback! I have updated the code to fix the error as suggested.