JarodMica / ai-voice-cloning

GNU General Public License v3.0
654 stars 144 forks source link

Wrong accent when using other languages #152

Open blastbeng opened 3 months ago

blastbeng commented 3 months ago

Hi, I am trying to clone my voice (ITALIAN) starting from a recording of myself, but I have some problems:

  1. How big must be the audio files to be used for training? Is a 2 minutes audio ok?
  2. I successfully trained my voice, but when i generate the audio He speaks incomprensibile words with an "english accent" and not italian accent, but i have the it_tokenizer in the config.... what could be the problem?
  3. What is an "optimal" training configuration? On the video guide you do something based on 20 epochs, is that enough? (I also did with 20)

Thanks