Only Static Noises? - Githubissues

SayaSS / vits-finetuning

Fine-Tuning your VITS model using a pre-trained model

MIT License

546 stars 86 forks source link

I'm not sure about your issue. But I think you should always use the G model when you try generating. I have a similar question so I post it here:

I have trained mine 600 epochs without a pre-trained model. Now I get something that sounds like human voices, but with some severe metallic noise. There are lots of warnings saying:

/content/vits/utils.py:138: WavFileWarning: Chunk (non-data) not understood, skipping it.
  sampling_rate, data = read(full_path)

Is this normal? Or should I recollect the dataset and start anew?

SayaSS / vits-finetuning

Only Static Noises? #31