NVIDIA / flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
https://nv-adlr.github.io/Flowtron
Apache License 2.0
887 stars 177 forks source link

Emotion Transfer #131

Open letrongan opened 2 years ago

letrongan commented 2 years ago

Hi everyone, I'm a new member of the group. Glad to have read the detailed instructions in the README and the previous discussions. I completed the voice training after 3 steps:

The results are quite good, the sound is easy to understand (with me and some my friends). I'm doing the emotional transfer(style_transfer) but the quality is very poor (The sound doesn't sound natural, doesn't feel right). I have tried with both the flowtron_ljs model for English and my model for Vietnamese.

Can someone who has completed this process give me some suggestions? Thanks a lot.

@rafaelvalle Please help me and give me some advice.

letrongan commented 2 years ago

Please.... Help me