-
I trained a speech model and the vocoder using the new version (espnet==0.10.6 and parallel_wavegan==0.5.4) and the synthesized voices have low quality. But when I use the old espnet (espnet==0.9.6 an…
-
I'm interested in using TTS for voice synthesis for reading text.
Maybe I can't find my way around here, but the readme gives a clear and concise way to get the tool for using this text-to-speech l…
-
Hi @kan-bayashi
A general question regarding the LibriTTS vocoders -- I am using them (melgan/hifigan/parallel_wavegan) to decode the spectrograms generated from LibriTTS Tacotron2/Transformer-TTS…
-
Hello again :)
Are lab mono files required to do the training or that step can be skipped using this script https://gist.github.com/kan-bayashi/eceafcd35a2351f5f6bf89a1ccb956e9 ?
-
Hi, I tried synthesising the wav file using 2 ways
Decoding using text2speech -> feat_gen -> vocoder produces different results than text2speech -> wav
```
text2speech = Text2Speech('your mode…
-
首先,感谢您在声码器上做的努力
我想问一下,推理速度上有与原始LPCNet做过对比吗
-
Updated unicorn, and espnet-tts is failing to start (keeps restarting the container, and does not work). It appears that the images were improperly built due to some dependencies on things in google d…
-
I am trying the finetuning part in this single speaker TTS training: https://github.com/kan-bayashi/ParallelWaveGAN/blob/master/egs/README.md
I am a first-time user. Would you help a little on thi…
-
DeprecationWarning: `np.complex` is a deprecated alias for the builtin `complex`. To silence this warning, use `complex` by itself. Doing this will not modify any behavior and is safe. If you specific…
-
in inference, the shape of c and x matches:
https://github.com/kan-bayashi/ParallelWaveGAN/blob/6d4411b65f9487de5ec49dabf029dc107f23192d/parallel_wavegan/models/style_melgan.py#L238
but in forward…