coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
MIT License
1.24k stars 133 forks source link

Mongolian 300 synthetic STT data + others #179

Open JRMeyer opened 2 years ago

JRMeyer commented 2 years ago

https://github.com/tugstugi/mongolian-nlp#datasets

JRMeyer commented 2 years ago

h/t @tugstugi :+1:

tugstugi commented 2 years ago

300 hours dataset is not Mongolian but Kalmyk (Western Mongolian language).

JRMeyer commented 2 years ago

thanks for correcting me, @tugstugi !