CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time
Other
52.05k stars 8.71k forks source link

How to make the perfect training data set? #1105

Open CodingRox82 opened 2 years ago

CodingRox82 commented 2 years ago

I want to use this repo to clone my voice. How do I create the perfect training data set? Here are some ideas I have:

-Read a set of words that contains every letter in the alphabet -Read a set of words in a normal tone, then read those same words with more energy/power, then read the same with more inflection, then sarcasm, then with inquisitiveness, etc. -Read continuously with no long pauses -Read words for a minimum of 3 minutes or X number of words, where X is some relatively large number -Minimize external noise, so maybe reserve some time in a professional audio production studio

@CorentinJ any tips here on how to make the best possible training data set?