Closed ming024 closed 4 years ago
multi-lingual-multi-speaker.zip
This is a result of my model trained on M-ailab dataset (partly and the dataset quality is not good.) with other datasets, let say this is an audio generated by multi-lingual-multi-speaker dataset (english is not the main language so if you train the model with only eng language, the result should be better), the model is modified a bit to work with multi lingual, i also tested the public code here for multi speaker and it work fine.
@ming024 if you don't have any question, pls close issue :D
I think the audio quality is very good. How many speakers are there in your datasets?
I think the audio quality is very good. How many speakers are there in your datasets?
Sorry, that is private infomation :D.
I have some too from MFA-aligned FS2. This is 11 speaker samples, total 18 hours of audio distributed over 136 speakers (distribution not uniform, some have as little as 20 seconds while others 50 minutes) 11samples-50ks.zip
@dathudeptrai @ZDisket Thanks a lot, I will close this issue #250
It is really an astonishing large project.
I have seen that there is multi-speaker support in the preprocessing scripts and model configs. It will be great if anyone can share the multi-speaker audio samples generated with the non-autoregressive models such as FastSpeech and FastSpeech2.