-
D:\迅雷下载\MockingBird-main>python demo_toolbox.py
Arguments:
datasets_root: None
enc_models_dir: encoder\saved_models
syn_models_dir: synthesizer\saved_models
voc_models_dir:…
-
For support and discussions, please use our [Discourse forums](https://github.com/PaddlePaddle/DeepSpeech/discussions).
Calculating MFCCs...
Traceback (most recent call last):
File "aligner/comma…
-
**环境配置**
环境window系统安装的ubuntu20.4虚拟机
Linux 024b4fd43ccc 5.10.16.3-microsoft-standard-WSL2 #1 SMP Fri Apr 2 22:23:49 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
nvcc
nvcc: NVIDIA (R) Cuda compiler dri…
-
尝试用tts_finetune的模式去做广东话克隆,声音是像的,可是就是电流声大,训练数据是来自于自己的麦克风录音,训练数据听起来很清晰的,可是finetune出来的结果就是大“震音/电流音”, 尝试用其他TTS生成的wav和差不多的量作为训练题材,克隆出来的效果很不错。请问效果不好是因为录音问题吗?
Finetune的步骤
1. 用MFA对齐
2. 使用了预训练模型fastspeech…
-
I'd like to inquire about the training results. I have combined datasets AISHELL3, aidata, and a Chinese dataset, totaling 600 hours of training. Although the three audio files are not 24000Hz, I have…
-
已支持的有 aidatatang(已验证200zh), Magic Data(已验证open SLR68)
需要更多请在这里提建议,并+1投票,将为大家补充支持
-
Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing si…
-
原来的issue太长了,所以关了重新开了一个。
首先是发现了revise_text.py里面的一个bug
```
def process(files, path):
text_dict = {}
with open("./text.txt" ,'r', encoding='utf-8') as text_file:
for line in text_fi…
-
我这个数据集是不是还不对呀,看着eval\loss还是不对劲
![image](https://user-images.githubusercontent.com/20817575/170834328-1e3a3ab2-2eec-468d-97b5-3792b08a71d2.png)
-
Can I use my own voice? 可以用自己的声音吗