Step-by-step adding foreign words to ViSV2TTS

v-nhandt21 / ViSV2TTS

Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS

38 stars 14 forks source link

Hi @drlor2k,

To handle Foreign languages, as my knowledge we have two main approaches:

Convert both Vietnamese and English to IPA or some unified grapheme, in viphoneme I use "eng_to_ipa" library, but I am not sure it would work in all cases, and I dont have detailed experiments on how much English data is enough!
The simpler approach is to use a dictionary in which each word is mapped to Vietnamese syllable pronunciation

p/s 2: maybe there are some multilingual end2end models that can speak all of this without phoneme control by using a unified tokenizer :))

v-nhandt21 / ViSV2TTS