Helsinki-NLP / Tatoeba-Challenge

Other
808 stars 91 forks source link

Indonesian missing? #5

Closed jvamvas closed 4 years ago

jvamvas commented 4 years ago

Thank you for providing this great resource!

It seems that Indonesian is not part of the Challenge right now. On Opus, there are 11.8M sentence pairs for Indonesian–English alone. Was Indonesian left out on purpose?

jorgtied commented 4 years ago

Indonesian is part of the 'msa' models as we use Macro-languages as specified in ISO639 standards.

jvamvas commented 4 years ago

Thanks!