thammegowda / mtdata

A tool that locates, downloads, and extracts machine translation corpora
https://pypi.org/project/mtdata/
Apache License 2.0
147 stars 22 forks source link

AllenAi nllb dataset (excluding ccmatrix) #134

Closed AlexUmnov closed 1 year ago

AlexUmnov commented 1 year ago

adding dataset for issue https://github.com/thammegowda/mtdata/issues/133

thammegowda commented 1 year ago

Thanks @AlexUmnov! I will merge it once I get a chance to test it.