-
Hi,
I know this library is primarily for text -> voice but do you know if it would be possible to modify it to accept a speaker embedding and perform zero-shot voice cloning?
Thanks!
-
I do everything according to the instructions in train.sh , downloaded an archive with audio and marks.txt , the folder that was required and nemo, I run - and here is such a series of errors
```…
Vubni updated
5 months ago
-
In Zero Shot mode,if the Prompt text or Target text does not end with punctuation (eg '.' or '?' or '!' ), the following error message will happen:
Traceback (most recent call last):
File "./tes…
-
@ZhangXInFD Are you simply replaced the 'NAR' of USLM with trained SoundStorm speech tokenizer for zero shot TTS task ?
Although quality of SoundStorm is much better Have you notice any speed advanta…
-
https://coqui.ai/blog/tts/yourtts-zero-shot-text-synthesis-low-resource-languages
最近新提出的YourTTS不知道有没有人关注,请问有人做过比较吗?
-
非常感谢有这样的杰出opensource,极大的降低了个性化tts的难度
关于inference_webui.py的问题,当我用它来zero shot,也就是在共有模型基础上,只提供8秒的参考音频进行tts。
1. get_tts_wav中对于待推理的文版进行了处理,phones2,bert2,norm_text2=get_phones_and_bert(text, text_language…
-
看运行只有CosyVoice-300M基础模型,下面client使用能分别使用sft|zero_shot|cross_lingual|instruct 4中模型?我看前面文档有写CosyVoice-300M-sft是tts,CosyVoice-300M直接是克隆,CosyVoice-300M-instruct 是加入语态控制
docker run -d --runtime=nvidia -p 5…
-
run inference code as shown below:
```
ref_path="female01.wav"
cosyvoice = CosyVoice("pretrained_models/CosyVoice-300M")
# zero_shot usage
prompt_speech_16k = load_wav(ref_path, 16000)
output…
-
I have just started playing this Tortoise TTS.
I have been mainly using the following Hugging Face Space to do some initial testing and experiment.
https://huggingface.co/spaces/mdnestor/tortois…
-
I have tried to install nemo_toolkit==2.0.0.rc0
but it shows error:
RuntimeError: causal_conv1d is only supported on CUDA 11.6 and above. Note: make sure nvcc has a supported version by running n…