facebookresearch / flores

Facebook Low Resource (FLoRes) MT Benchmark
Other
705 stars 123 forks source link

how to get the monolingual corpus #4

Closed cocaer closed 5 years ago

cocaer commented 5 years ago

The code seems written for downloading parallel corpus. If I want to do some work with UNMT, how I can get the monolingual Common Crawl corpus referred in the paper? Nepali and Sinhala corpus are not found on paracrawl.eu homepage.