-
What configuration parameters should I modify in order to train with 16kHz audio?
I intend to work with:
sampling_rate: 16000
fft_size: 512
hop_size: 128
I believe I should also change upsamp…
-
Hi, thank you so much for this fantastic repo and I apologize if this is an ignorant question, but on my GTX 1070 Ti the pre-trained wavenet.mol.v1 models don't decode mels faster than real-time. I pr…
-
Hi,
I've come across the following error.
```
> ---------------------------------------------------------------------------
> RuntimeError Traceback (most recent cal…
-
-
I tried to test Chinese TTS with scripts, but I didn't get the expected results. What is the format of Chinese Pinyin input?
-
I observed a interesting behaviour after 138K iters where discriminator dominated the training and generator exploded in both train and validation losses. Do you have any idea why and how to prevent …
-
Hello, I just found "checkpoint-400000steps.pkl" in your google drive,and I want to synthsis Chinese voice with your pretrained csmsc model, but now I don't know how to use it.
Can you tell me if…
-
I'm working on extending TTS to the languages and speakers available in /egs/m_ailabs but synth_wav.sh will only output a default speaker.
I have followed this procedure: executed run.sh through s…
-
I am using Tacotron2 trained on a custom dataset on V3 Config, along with a custom trained ParallelWaveGAN FROM @kan-bayashi implementation. Everything works well but i have two concerns.
Q1 - Is t…
-
I compared the following two models:
- (Red) The model which trains the discriminator from 200k iters
- (Blue) The model which trains the discriminator from the first iter
Here is the training curv…