the model is hard to converge with LJSpeech

syang1993 / gst-tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"

368 stars 110 forks source link

Hi! Thanks for your contribution! I have trained the model on LJSpeech dataset with your codes. But I found the loss is not converge with your default hparams. Here are some results on tensorboard. Could you give me some advice?

batch_size=32 lr=0.002
batch_size=32 lr=0.001
batch_size=64 lr=0.001
batch_size =64 lr=0.0006
batch_size=32 lr=0.0001
batch_size=32 lr=0.00002

Finally, the model seems converge. But the alignment is not good. The step-51000-align.png is like this. Should I keep on training or kill this process and try other hparams? Can you give me some advice?

syang1993 / gst-tacotron

the model is hard to converge with LJSpeech #18