myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.
https://research.myshell.ai/open-voice
MIT License
29.97k stars 2.96k forks source link

some qustions? #29

Open GuangChen2016 opened 11 months ago

GuangChen2016 commented 11 months ago

Hi, I have some questions as belows:

  1. Are the speaker encoder models of the base tts models and tone color converter model be the same model structure? Is there any connection between base tts models and tone color converter model?
  2. During training, for text-audio pair <x, y>, are the reference speaker audio, the output of tone color converter model (speech with reference tone color and controlled styles) and g from both flow and reverse flow all from y?
  3. Would you plan to release the codes of the training parts, we still could not train a good model following your paper. Thanks a lot
abhishekmaroon5 commented 11 months ago

are you guys planning to release its code