MycroftAI / mimic2

Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Apache License 2.0
581 stars 103 forks source link

Model close to getting aligned, but still jumbled #33

Open Scrollkeeper opened 5 years ago

Scrollkeeper commented 5 years ago

Hi again, I've trained the model up to 89,000 steps. It is close to getting aligned, but is still not quite there. The problem is that it has been pretty much the same since 38,000 steps. Synthesis sounds flawless when exporting from train.py but is very poor from eval.py Some examples:

step-30000-align step-38000-align step-74000-align step-83000-align

It could just be that the dataset isn't cohesive enough, but I'm not certain.
Thank you for your time and input! :)

Scrollkeeper commented 5 years ago

Some more graphs: step-26000-align step-28000-align step-36000-align step-43500-align step-50500-align step-53500-align step-66500-align