I retrain the model in VCTK corpus following the training steps in your paper. However, I only get noise in my generated voice. I have a quesion in the file "metadata.pkl" in your project. Does the mel feature matrix in this file has already passed "Deconv Layer"?
I retrain the model in VCTK corpus following the training steps in your paper. However, I only get noise in my generated voice. I have a quesion in the file "metadata.pkl" in your project. Does the mel feature matrix in this file has already passed "Deconv Layer"?