Closed ioneuk closed 2 years ago
Hey there.
Hi @neonbjb,
First off, thank you for the incredible work on this project! I have a quick question regarding the Mel Spectrograms used in the discrete autoencoder and diffusion training. Could you please clarify what type of normalization is applied to the Mel Spectrograms during these stages?
Thanks in advance!
Hello. Your work is absolutely great. One of the best TTS that I've ever seen. I am trying to understand your work conceptually. My questions are related to VQ-VAE pretraining on the speech data: