OlaWod / FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
MIT License
591 stars 109 forks source link

target pitch issue after training (not appearing if using the pretrained checkpoint) #85

Open fervillamar opened 10 months ago

fervillamar commented 10 months ago

Hi, I'm running trainings with and w/o using the pretained checkpoint (VCTK) as initial state. However, in both cases the target pitch is affected by the input pitch (e.g. from female to male conversion, the target pitch will be higher, like somewhere between the source and target speakers range). This was not happening with the pre-trained model itself. Would you mind to share some comments on things that were considered to trained the pre-trrained model that may be missing in the paper or here in this repository?, did you experience this in your experimentation?, thanks in advance.

NLPV2011 commented 9 months ago

I think your speakers should be recorded with many different pitches, not just 1 pitch, I think this can be fixed