Closed pavelxx1 closed 3 years ago
If the feature vectors are extracted correctly, that's no problem. But I assumed every audio length is around 5s.
Thx, 1) How many wav- files(dataset) need for good training result? 2) If I want convert speakerA to speakerB can I use small dataset of speakerA, ie 5-10 wav-files? Thx
Hi, first thx for this great repo!) I have question