Open Lukelluke opened 3 years ago
Hi I have not trained one-hot version, but I have some idea to say~ The only difference between one-hot and speaker encoder version is: weather the speaker's embedding can be trained by AutoVC training process. How to train in one-hot pattern, may like this:
In fact, 「How to train in one-hot pattern」in author's mind may be just the most simple way to train model when face to multi-speaker problem, it's better than speaker encoder version because it's embedding can change by gradient , but speaker encoder's embedding can not.
Hi, @auspicious3000 ,
I find all the files and issues, didn't find any description on 「How to train in one-hot pattern」, which u suggest we to train just in that mode, if we don't have the necessary to apply 「one-shot」performance.
Could any friend who have successfully trained Auto-VC in one-hot mode, not the embedding with pretrained speaker-encoder?
Hope to get any useful reply from u all !
All the best, Luke Huang