Vocos was highly sensitive to the frequency band range of input features. When using 100-dimensional
full-band log-mel-spectrograms as input, Vocos exhibited a significant improvement.
reference: https://arxiv.org/pdf/2311.11545.pdf
APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra
Vocos was highly sensitive to the frequency band range of input features. When using 100-dimensional full-band log-mel-spectrograms as input, Vocos exhibited a significant improvement.![vocos](https://github.com/wenet-e2e/wetts/assets/49022799/4687d2d9-7e27-4a0b-b323-7724d6ea6836)