-
Hi @KdaiP nice work, just like to know is this architecture is intended to support zero-shot TTS or normal multi-speaker kind of TTS,
-
run inference code as shown below:
```
ref_path="female01.wav"
cosyvoice = CosyVoice("pretrained_models/CosyVoice-300M")
# zero_shot usage
prompt_speech_16k = load_wav(ref_path, 16000)
output…
-
Hi, there are some examples of tts(zero shot) on libritts?
-
Hello, thank for sharing your awesome work
Can you please tell me, if there's an ability to to TTS with your model with speech conditioning? Zero-shot tts the way tortoise tts or xtts does that
As I…
-
Thank you for your open source work, but I seem to have not found the complete implementation of zero-shot TTS.
1. The default dataset for radtts in the tutorials does not include the file coqui_re…
-
Hi, thank you for open-sourcing your excellent work. ❤️
I would like to compare with VoiceCraft as a baseline for my research. I have observed that you have released three TTS enhanced models. I am…
zjlww updated
4 weeks ago
-
Can you explain the difference between the FAcodec pretrained model "FACodecEncoderV2" vs "FACodecEncoder" ?
Why using "FACodecEncoderV2" to do zero-shot TTS?
Are these two difference from the t…
-
Hi,
I know this library is primarily for text -> voice but do you know if it would be possible to modify it to accept a speaker embedding and perform zero-shot voice cloning?
Thanks!
-
I do everything according to the instructions in train.sh , downloaded an archive with audio and marks.txt , the folder that was required and nemo, I run - and here is such a series of errors
```…
Vubni updated
5 months ago
-
@ZhangXInFD Are you simply replaced the 'NAR' of USLM with trained SoundStorm speech tokenizer for zero shot TTS task ?
Although quality of SoundStorm is much better Have you notice any speed advanta…