I have been testing a model with my voice. The voice is very clear and has the same properties as the example voice in spanish that works
The problem is that after 100000 steps, it does not vocalize and is impossible to understand any word. On the other hand, the evaluation wavs sound really well.
What I need to change? As I said, I have the same parameters as a voice that works.
Hello!!
I have been testing a model with my voice. The voice is very clear and has the same properties as the example voice in spanish that works
The problem is that after 100000 steps, it does not vocalize and is impossible to understand any word. On the other hand, the evaluation wavs sound really well.
What I need to change? As I said, I have the same parameters as a voice that works.
NOTE: My dataset has around 8 hours of audios
Thank you!!!
There are some aligment outputs:
================= 0 ===================
![linear-batch_0_sentence_0](https://user-images.githubusercontent.com/50680821/60080602-878e8700-9730-11e9-996e-363fe9b7a705.png)
================= 1 ===================
![linear-batch_1_sentence_0](https://user-images.githubusercontent.com/50680821/60080608-8b220e00-9730-11e9-8fc0-f48dfc64ced8.png)
================= 2 ===================
![linear-batch_2_sentence_0](https://user-images.githubusercontent.com/50680821/60080618-8e1cfe80-9730-11e9-9fce-a4d480a82f63.png)
================= 3 ===================
![linear-batch_3_sentence_0](https://user-images.githubusercontent.com/50680821/60080630-937a4900-9730-11e9-8136-83f7f4834863.png)
================= 4 ===================
![linear-batch_4_sentence_0](https://user-images.githubusercontent.com/50680821/60080638-970dd000-9730-11e9-9130-d460781f3311.png)