Open kpu opened 3 years ago
It has been such a valuable resource! https://jw.org has 1000 languages https://glosbe.com has 6000 languages (most of them are dictionaries, but there are 2B+ sentence pairs) If these two allow us to crawl their site text for NLP research+apps ....
Do we have any idea if it will back online anytime soon?
sorry for replying to this extremely old thread, but i just heard of this dataset, i'm sad it's down. i wish there was a backup somewhere.
Something related to copyright, they are trying to get proper permission.