After some surveying in the previous ticket #8 , coqui xTTS model looks the most promising direction to go for Taiwanese accent downloading, since it supports multi-languages, it is open sourced that allows for commercial use, and it has good performance quality than Bark, and MS-TTS model.
Goal
trained a fine-tuned xTTS model on Taiwanese dataset and review the result of synthesized output audios.
Existing Status & Motivation
After some surveying in the previous ticket #8 , coqui xTTS model looks the most promising direction to go for Taiwanese accent downloading, since it supports multi-languages, it is open sourced that allows for commercial use, and it has good performance quality than Bark, and MS-TTS model.
Goal
trained a fine-tuned xTTS model on Taiwanese dataset and review the result of synthesized output audios.
References