Closed ghost closed 5 years ago
Good luck.
https://github.com/r9y9/deepvoice3_pytorch uses LWS and (maybe subjective) but I think it have better quality then GL.
Also here they use WORLD and their demos have good quality, but I can't find any repo that reproduce it https://mtg.github.io/singing-synthesis-demos/
Also here is my test for WORLD vocoder, i.e. what it will sound like if smoothed features will be predicted. https://github.com/mozilla/TTS/issues/9#issuecomment-497303645
Hi, just wanted to know what's the state of the different vocoders. I have several questions:
From my tests, I've seen LWS is 10x faster than GL. Is this correct?
To implement LWS would it be enough to calculate the mel and specs using LWS and use it also to invert them?
How would GL and LWS compare with WaveRNN in quality and inference time?
Is there any update on the WORLD vocoder?
Thanks a lot.