Open trueProgrammer opened 4 years ago
it should be fine to train multiple languages simultaneously and it's better to use the dictionary of your target language. make sure the model has the correct set of token embeddings and that the dataloader is using the right token ids for each language.
the issue you're observing might be related to wrong inputs.
First of all, thank you for releasing the code and making this fantastic paper!
I have read the instructions and all the issues and after that, I started to train the model. I used an Ljs dataset and a single male Russian speaker with about 40h of good clean speech. My steps were:
Even I got promising graphics the result of the audio is awful because it sounds like another language and you can't understand any word at all. I have a few questions:
Thank you in advance!