-
my attention is empty after 10k steps, which shouldn't be normal.
I'm using LJSpeech dataset.
This is the second time I preprocessed everything and trained.
![image](https://user-images.githubuse…
-
Hi, does anyone have the exception throwed, like the one described in the following stack:
Constructing model: WaveNet
Initializing Wavenet model. Dimensions (? = dynamic shape):
Train mode:…
-
1.is there any limit on data when preprocessing ?
I use the dataset with 15438 when in preprocessing it only produces 12988
2. what can be changed on h.params to solve OOM on wavenet training
-
Is the nancy corpus pre-trained model available anywhere for use? I think the one provided in the README is LJ Speech trained model
-
Hello,everyone.I synthesis the model on CPU for 6 minutes. How can I speed up and do real time synthesis on cpu.thank you
-
In the ReadMe, it's mentioned
> Convert the data generated at the last step which has .f32 extension to what could be loaded with numpy. I merge it to the Tacotron feeder here and here with the fo…
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
First, Thanks for the excellent work by CorentinJ!
I noticed that the speaker encoder used in this work is ge2e, performance of which is far fall behind the SOTA. So I replaced the ge2e encoder with …
-
I've trained a model based on the LJSpeech dataset and found the results quite satisfactory after 25000 steps in ForwardTacotron. Now, I'm currently preparing several other datasets where new models w…
-
Can someone share a voice sample he created with this repository based on a given and/or a custom set of voice files