myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.
https://research.myshell.ai/open-voice
MIT License
28.08k stars 2.75k forks source link

some qustions? #29

Open GuangChen2016 opened 7 months ago

GuangChen2016 commented 7 months ago

Hi, I have some questions as belows:

  1. Are the speaker encoder models of the base tts models and tone color converter model be the same model structure? Is there any connection between base tts models and tone color converter model?
  2. During training, for text-audio pair <x, y>, are the reference speaker audio, the output of tone color converter model (speech with reference tone color and controlled styles) and g from both flow and reverse flow all from y?
  3. Would you plan to release the codes of the training parts, we still could not train a good model following your paper. Thanks a lot
abhishekmaroon5 commented 7 months ago

are you guys planning to release its code