This implementation should work on any datasets stored similarly to VCTK, it's just a manner of finding new data.
For TTS, you also need to implement local conditioning and find a tool that can provide local speech features for your language of choice.
This implementation should work on any datasets stored similarly to VCTK, it's just a manner of finding new data. For TTS, you also need to implement local conditioning and find a tool that can provide local speech features for your language of choice.