Is it possible to change the voice of the TTS model i.e. adapt/finetune it to a new dataset with a different voice such as in this paper: Cascading ASR + TTS. From what I read in the documentation and samples, its possible to do style transfer, but didn't find anything on voice conversion.
Hi team,
Is it possible to change the voice of the TTS model i.e. adapt/finetune it to a new dataset with a different voice such as in this paper: Cascading ASR + TTS. From what I read in the documentation and samples, its possible to do style transfer, but didn't find anything on voice conversion.