syang1993 / gst-tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
368 stars 110 forks source link

where do you insert or import wav file of models voice for training? #35

Open pnwseeker opened 5 years ago

pnwseeker commented 5 years ago

Hi , I have been looking at many different repos of tacotron here and there and I get a bit confused as I am much of a novice. I see everywhere on all repos the same instructions but none of them are specific on where in the process do I import the audio recording of my voice so that it can be trained into a new model? where do i put my wav file of my voice? How long of a sample should it be? where in this process does the scripts call for my recorded voice file? Any help will be greatly appreciated. I manged to get the Real Time Voice cloning toolbox to work but even still the only thing i can do with that is use the toolbox to do a quick demo not fully train.