The process of converting spectrograms into audio is very slow

auspicious3000 / autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

https://arxiv.org/abs/1905.05879

MIT License

987 stars 205 forks source link

The process of converting spectrograms into audio is very slow #125

Closed idaoi closed 1 month ago

idaoi commented 1 month ago

The process of converting the spectrogram output by the model to audio using wavegen is very slow. In the example of vocoder.ipynb, each audio segment requires 20 minutes of conversion time. Is this normal? My graphics card is RTX2060 with CUDA.