-
What I have done
1. conda create --name espnet python=3.10
2. conda activate espnet
3. git clone git clone https://github.com/espnet/espnet.git
4. pip install -q espnet==202308 pypinyin==0.44.0 pa…
-
Hi, first of all, big thanks to all the people who helped develop the MelGAN model! The inference speed is super fast!
I modified the configs from the `VCTK` recipe to train a [parallel_wavegan.v1…
-
i have a people 50 sentences voice, could i train a WaveGan model with 50 sentences, Let the sound adapt the people?
wac81 updated
3 years ago
-
Can create separate branch for TTS implementation, that's the ultimate goal for every neural vocoder. I will try to use this implementation with nvidia's Tacotron2, as preprocessing for both networks …
-
## Description
(A clear and concise description of what the feature is.)
A api that can let user uses QPPWG not in a very user friendly sense.
This is just the start.I also do not have enough exper…
-
Hello, I trained MB Melgan v3 700k steps on small 570 utterance single-speaker dataset and output very robotic and loss curves look not so good. What am I doing wrong? I also have bad results when res…
gafsd updated
3 years ago
-
@kan-bayashi , FYI.
https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9455356
-
Can you provide me with the code to convert wav to wav to do it the indirect way?
-
My dataset is 184G in the dump folder, and I have approximately 150 speakers with mixed songs and speech, and I used the default recipe settings. After the model finished training at 400k steps, the g…
ghost updated
3 years ago
-
## Please report TTS text frontend bugs here, for examples: text normalization, polyphone and tone sandhi, etc.
**We encourage developers to solve these problems.**
1. polyphone: 能说多长(zhang3 ❎)的…