Open alokprasad opened 5 years ago
extrally. T2 maybe needs a large dataset. So, training them with large dataset such as LJspeech is good idea.
Retrain Tacotron2+lpcnet and train lpcnet to use GTA mode?
@lyz04551 I do not suggest to use GTA mode if you have not read the LPCNet code deeply. You should figure out the start and end of the audio or other transformation introduced by feature extractation of LPCNet.
@lyz04551 I do not suggest to use GTA mode if you have not read the LPCNet code deeply. You should figure out the start and end of the audio or other transformation introduced by feature extractation of LPCNet.
Thank you for your reply. Have you encountered any problems with this synthetic audio volume? I use the LPCTron code of alokprasad. The basic parameters are the same. It is the same as the training method recommended by you. The Chinese data set used is retrained. Do you have a similar volume problem?
@lyz04551 You could normalize the audio before features extraction.
@lyz04551 You could normalize the audio before features extraction.
What can be said in detail, is the normalization of the magnitude?
@lyz04551 yes, rescale the audio volume which you can refer to the tacotron2 preprocessing.
@lyz04551 yes, rescale the audio volume which you can refer to the tacotron2 preprocessing.
test.zip This is the sound that I combined with Tacotron+lpcnet. I always feel that the sound quality is not particularly good. Can you hear some questions?
Hello @lyz04551 did you fixed the issue? I'm facing the same.
Examples here from taco2 and from the lpc extracted, it also contains my hparams : audios.zip
Which params am I missing? Is the same datasets with the same extracted features, any idea?
From the real features looks good.
From taco2 looks like is applying a filter, but can't figure out where.
Thanks @MlWoo and @alokprasad for the scripts, saved me a lot of time.
1.Do we need to train LPCNET with LJSpeech dataset or 16k-LP7? 2.DO we need to train both LPCNET and tacatron2 with same dataset? 3.Do we need to Tacatron-2/preprocess.py or just use ./header_removal.sh-->pcm files ./feature_extract.sh-->f32 files train.py takes wavand npy files how will it takes pcm and f32 file.?