Closed scripples closed 4 years ago
Hey, just prepare your dataset in the LJSpeech format:
|- dataset_folder/
| |- metadata.csv
| |- wav/
| |- file1.wav
| |- ...
If the language differs from English, make sure you set the correct language in the hparams.py file:
language = 'fr'
tts_cleaner_name = 'basic_cleaners'
Then just follow the steps from the README with preprocessing the folder, everything should be done automatically including splitting of the dataset into train/val etc.
I updated the README to be clearer on this. Best of luck!
Thanks for your help! I'm looking forward to giving it a try.
Where find list is supported languages?
@paklau99988 you can find the list of languages from here
Hi, I'm trying to prep my own dataset to train on the ForwardTacotron model--could you give any insight as to what train_tacotron.py or train_forward.py is expecting in terms of training data organization? Like, the old NVIDIA TT2 repo expects two text files formatted in a certain way and a path to the WAV files in the arguments. Is there something similar for this repo?