syang1993 / gst-tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
368 stars 110 forks source link

Unable to reproduce results #44

Open Anchit1999 opened 3 years ago

Anchit1999 commented 3 years ago

Hi, I am using this exact code with same hyperparameters but the results produces are no way close to the sample results shown in this repository. I have tried training using both LJSpeech and Blizzard dataset. Output from both the models have some noise present in it. What could be the possible reason?