Closed 2Bye closed 2 years ago
Hi, I didn't test the using of HifiGAN to convert the mel-spectrograms generated by the released VQMIVC models, maybe you need to check whether your extraction way of mel-spectrograms used to train HifiGAN is the same as mine, if not, then the generated waveform by your HifiGAN is problematic.
@2Bye 你好,我也遇到了这个问题,请问你最后是用的hifigan吗?我训练的中文数据集
Hello, i tried use another vocoder - HiFi Gan with your model. But i faced with problem which get output with noise audio or silence.
I transopted the logmel output to a regular input for HiFiGan [1, 80, X] for int16 i get very noise audio, for int32 i get silence
My inference code: