thammegowda / mtdata

A tool that locates, downloads, and extracts machine translation corpora
https://pypi.org/project/mtdata/
Apache License 2.0
147 stars 22 forks source link

Lanfrica #106

Open kpu opened 2 years ago

kpu commented 2 years ago

From Masakhane slack, there's a new website by @chrisemezue et al: https://lanfrica.com/ that links to African language resources. Some of these are parallel corpora. Of those, some are duplicates (e.g. OPUS). There are definitely resources like https://zenodo.org/record/5089560 that are currently not known to mtdata.