I've retrained the text2mel model, by cutting out mel reduction part in preprocessor, and changing the hparams to:
hop_length = 256win_length = 1024max_N = 180 # Maximum number of characters.max_T = 210 # Maximum number of mel frames.e = 512 # embedding dimensiond = 256 # Text2Mel hidden unit dimension
I'm trying to feed generated mels to MelGan, but output audio file is just noisy honk.
Any ideas?
Hey!
I've retrained the text2mel model, by cutting out mel reduction part in preprocessor, and changing the hparams to:
hop_length = 256
win_length = 1024
max_N = 180 # Maximum number of characters.
max_T = 210 # Maximum number of mel frames.
e = 512 # embedding dimension
d = 256 # Text2Mel hidden unit dimension
I'm trying to feed generated mels to MelGan, but output audio file is just noisy honk. Any ideas?