Closed Sueoka-ppc closed 2 years ago
Please check shape. It might be transposed. https://github.com/kan-bayashi/ParallelWaveGAN/blob/e027f53ee7c5dc813d61cf3a47749a6e2abc9369/parallel_wavegan/models/hifigan.py#L251-L260
Sorry I am primitive of Tensor I don't know how to fix this.
Change shape or this spectrogram is not suitable this model?
Please check the shape of inputs. Maybe your input shape is (#mels, #frames) or (#batch, #frames, #mels) but my implementation assumes (#frames, #mels). Please modify the shape by yourself.
I try to reshape but runtime error happen
RuntimeError: Expected 3-dimensional input for 3-dimensional weight [512, 80, 7], but got 4-dimensional input of size [1, 80, 226, 1] instead
Can anyone help me?
You use wrong shape. Please carefully check docstring. Input is 2d (#frames, #mels) https://github.com/kan-bayashi/ParallelWaveGAN/blob/e027f53ee7c5dc813d61cf3a47749a6e2abc9369/parallel_wavegan/models/hifigan.py#L251-L260
reshape is successfully but generated wav is broken. I think miss match between spec generator and vocoder.
Thanks to support me.
I try to infer spectrogram which was yielded from NeMo glow_tts models (and Vocoder model is using pre-trained models jsut_multi_band_melgan.v2 or just_hifigan.v1 )
But blow error was happened RuntimeError: The size of tensor a (226) must match the size of tensor b (80) at non-singleton dimension 2
can anyone help me?
my code