-
Hi, I trained multi-speaker TTS with around 100 users using template of "espnet/egs/Libritts" with non-English language. Before I successfully trained and tested non-English single speaker. During m…
-
**Describe the bug**
In the docs screenshot there's a dropdown to select desired model:
https://github.com/coqui-ai/TTS/raw/main/images/demo_server.gif
but in local tts-server there's no dropdo…
-
Hi @Rongjiehuang - I was wondering if you've encountered something like this in your experiment with multi-band WaveRNN? I find that when I disable the `.astype(np.int64)` call in [here](https://gith…
-
I got this error when running example in colab
espnet==0.9.7
parallel_wavegan==0.4.8
![image](https://user-images.githubusercontent.com/31467068/122771365-52967a00-d2e1-11eb-830a-c17658e4e3f6.png…
-
So im not %100 sure this is already implemented in the version of tacotron thats included with this build but tacotron has support for GSTs (global Style tokens) now I think its already using these be…
-
Please, I want to know if there is a pre-trained model in the Sapnish or German language. I was creating an implementation but I need this two languages for make it more accessible in my country. In o…
-
Hi, I have noticed that the loudness of the synthesized wavefrom varies for PWG. Is it possible tom make sure that the synthesised waveform has the same loudness?
-
Hi. I want to implement real-time TTS demo with my model.
(link : https://colab.research.google.com/github/espnet/notebook/blob/master/espnet2_tts_realtime_demo.ipynb#scrollTo=J-Bvca5mE7bT)
But in…
-
Hi @kan-bayashi
I have noticed that the RTF varies even when we use manual seeds for both taco2 and pwg. I am wondering where is this randomness coming from which makes the calculation of tensors …
-
In the Synthesis section of your [colab](https://colab.research.google.com/github/espnet/notebook/blob/master/espnet2_tts_realtime_demo.ipynb#scrollTo=o87zK1NLobne) when I try with my own wav file it …