jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
https://jaywalnut310.github.io/vits-demo/index.html
MIT License
6.73k stars 1.24k forks source link

What tools do you use to create/split file lists for new dataset? #36

Open skol101 opened 2 years ago

skol101 commented 2 years ago

Please advise as per subject.

nikich340 commented 2 years ago

You should have some file prepared where wavs paths are assigned with texts. Then run preprocess script as written in step 4 of "Pre-requisites" to convert text to IPA phonemes. Note that you must have eSpeak installed correctly. Resulting .cleaned file you just split manually into train (95%) and valid (5%) filelists, set paths to them in your config.json