Sharing the results of the Korean model learned in my voice

jaywalnut310 / glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

MIT License

651 stars 150 forks source link

I used Glow-TTS and Multi-band MelGAN to create Korean TTS using my voice as a dataset. And the sample audio of the result can be found at the bottom of the Colab page.

Demo Colab

The dataset I used was only about 3 hours long, but I got really good results. I'm not very familiar with machine learning, but the results I could make using your code were really impressive. I was very pleased with the quality of the final result and the fast synthesis speed. It is especially good because it can be used even in a CPU environment.

Thanks to all of you for sharing such a great work 😄

jaywalnut310 / glow-tts

Sharing the results of the Korean model learned in my voice #31