Hello, I'm trying to train deepvoice on russian single speaker dataset.
Hyperparameters are the same as nyanko_ljspeech.
After 660k steps predicted speech is terribly noisy and impossible to understand.
I guess I have to tweak some hyperparameters but I dont know what exactly.
Alignments of the 660k steps prediction:
Hello, I'm trying to train deepvoice on russian single speaker dataset. Hyperparameters are the same as nyanko_ljspeech. After 660k steps predicted speech is terribly noisy and impossible to understand. I guess I have to tweak some hyperparameters but I dont know what exactly. Alignments of the 660k steps prediction: