thammegowda / mtdata

A tool that locates, downloads, and extracts machine translation corpora
https://pypi.org/project/mtdata/
Apache License 2.0
147 stars 22 forks source link

AllenAi nllb dataset (excluding ccmatrix) #134

Closed AlexUmnov closed 2 years ago

AlexUmnov commented 2 years ago

adding dataset for issue https://github.com/thammegowda/mtdata/issues/133

thammegowda commented 2 years ago

Thanks @AlexUmnov! I will merge it once I get a chance to test it.