Artur_B_Studio (inside the dataset) contains 50 hours of a single speaker recorded in a studio (high quality). In total there are 800 transcribed hours (multiple speakers, varying quality)
for phonemizer you can use espeak-ng with "sl" language ("slovenian" voice)
Hi! Could you share your experience how to create model for new language, please?
It would be so helpful, I want to create model for Greek language and your advices can help me
🚀 Feature
Please add support for slovenian language, here you can find a quality dataset:
Artur_B_Studio (inside the dataset) contains 50 hours of a single speaker recorded in a studio (high quality). In total there are 800 transcribed hours (multiple speakers, varying quality)
for phonemizer you can use espeak-ng with "sl" language ("slovenian" voice)