vlomme / Multi-Tacotron-Voice-Cloning

Phoneme multilingual(Russian-English) voice cloning based on
https://github.com/CorentinJ/Real-Time-Voice-Cloning
Other
379 stars 97 forks source link

Use of language embedding #27

Open astricks opened 2 years ago

astricks commented 2 years ago

Hi @vlomme,

Great work here, and thanks for open-sourcing it. I'm trying to understand how this works so that I can replicate it. I've gone through the code and don't see any language embedding, which I thought would be how you separate the speaker from the language.

Can you please explain how language-speaker independence is achieved?