Closed sdx0112 closed 5 years ago
If I am not taking it wrongly, your instruction is to adapt to VCTK speakers only. My question is to adapt to any arbitrary new speaker.
Not limited to VCTK. You'd need to write some code depending on your dataset though. See ljspeech.py for example.
If I have some audio files from a new speaker, how can I adapt to this speaker using pre-trained model?