myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
MIT License
4.59k stars 585 forks source link

Is there a way or are there plans to introduce emotions? #187

Open juangea opened 1 month ago

juangea commented 1 month ago

Hi there.

I'm using Melo and the quality is very good, in spanish works very well, however it's emotion-less, it feels dead, it's greaet for some uses, but for others is very monotone, is there a way to introduce amotion or are there any plans to do so?

Thanks!

dezynetechnologies commented 1 day ago

I guess you can use tone-coloring from OpenVoice..We have used meloTTS for creating AI generated voices using clear recording of a single speaker for around 40-45 minutes. We trained for around 1000 epochs.

juangea commented 1 day ago

How could we use tone coloring?

The reference audio changes the tone and the emotion? I thought it was only trying to clone the voice, but not the emotion.

For us, Melo is giving good results, in relation to clarity of speech, the problem is that the entonation is too emotionless and monotone, and that's what we are trying to solve.

The trained voice you trained, sound more natural?