keithito / tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
MIT License
2.96k stars 957 forks source link

When I iterate 13,000 times, why is the synthesized speech a piece of silence #273

Open Text2-m opened 5 years ago

keithito commented 5 years ago

It's hard to say without more information, but 13k iterations is probably not enough.

vinnitu commented 5 years ago

how many iterations need?

vinnitu commented 5 years ago

It is normal? e78123b6-5aca-4233-9472-ded968904295.zip step-12000-audio.zip

japita-se commented 5 years ago

Me too I have 15k steps for Moilla Dataset. The attention plot seems good but the synthesis produces noise.

step-15000-align