nltk / nltk_data

NLTK Data
1.43k stars 1.03k forks source link

Tatoeba corpora #122

Open MohammedBelkacem opened 5 years ago

MohammedBelkacem commented 5 years ago

Tatoeba is a multiple corpora: Text, audio, translation. It's released with a public licence. We could add it within ntlk corpora data download? By the way, I'm interested with the Kabyle corpora as I'm using ntlk for some processing tasks.