Closed hdmjdp closed 3 years ago
Because of the length correlation between input waveforms and mel-spectrum.
As shown in above, the length of the input waveform (audio
) is equal to that the length of mel-spectrum minus 4 and multiply by the hop_length.
Similar process can find in Parallel WaveGAN.
ok. In process data, I did not minus 4. So in my version, I think no need to "cond_length - 4 ) ".
https://github.com/ZENGZHEN-TTS/LVCNet/blob/6e7748e7ef358f7d95b8ce08bb682e7993f9c639/vocoder/models/lvcnet.py#L76
https://github.com/ZENGZHEN-TTS/LVCNet/blob/6e7748e7ef358f7d95b8ce08bb682e7993f9c639/vocoder/models/lvcnet.py#L45
if I pad 2