Epokhe / voice-mimicker

A neural net for mimicking someone's speech
10 stars 1 forks source link

training #1

Open hurnhu opened 6 years ago

hurnhu commented 6 years ago

what is the training process and requirements?

Epokhe commented 4 years ago

Wow, I just saw this and it may be a bit late, sorry 😅

Training process is explained a bit in the report. Summary:

Encoder part is trained more than decoder part-I had a hyperparameter to tune the ratio-because training encoder is harder than training the decoder. Training took 3-4 days with a GTX 1070.

image

In hindsight, I would improve my result with these two steps without changing the model architecture:

There are recent very successful projects, check them out too: https://github.com/CorentinJ/Real-Time-Voice-Cloning