-
Very nice repo! Thank you authors for your contribution.
And here is my situation: I have been trying to use about 20000 hours of open-source speech data to follow this repo (version 1.2.7) and sta…
-
Hello, I trained MB Melgan v3 700k steps on small 570 utterance single-speaker dataset and output very robotic and loss curves look not so good. What am I doing wrong? I also have bad results when res…
gafsd updated
3 years ago
-
@kan-bayashi , FYI.
https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9455356
-
Hello.
Is this repository working with TF>2? I'm having a hard time with virtual environments and these older versions on linux.
Thanks
-
Can you provide me with the code to convert wav to wav to do it the indirect way?
-
My dataset is 184G in the dump folder, and I have approximately 150 speakers with mixed songs and speech, and I used the default recipe settings. After the model finished training at 400k steps, the g…
ghost updated
3 years ago
-
## Please report TTS text frontend bugs here, for examples: text normalization, polyphone and tone sandhi, etc.
**We encourage developers to solve these problems.**
1. polyphone: 能说多长(zhang3 ❎)的…
-
Even though the preprocessors run sequentially, when the docker container grabs GPU RAM, it hangs onto it forever. Our current preprocessors outstrip the 6GB GPU on unicorn, and this will be an increa…
-
First of all thank you for creating those videos and articles explaining the details.
It is very useful for reference.
However after reading and looking for a while I still cannot confirm some det…
-
原来的issue太长了,所以关了重新开了一个。
首先是发现了revise_text.py里面的一个bug
```
def process(files, path):
text_dict = {}
with open("./text.txt" ,'r', encoding='utf-8') as text_file:
for line in text_fi…