-
I trained a smaller model for TTS task only as the TTS recipe instructed. However, it can not generate the correct TTS audio output (the generated audio content is not matched to the text). Have you e…
-
May I ask if this implementation of the model has been experimented on the MEL spectrum.? I used
Transformer model with only convolutional positional coding added at the beginning to get discontinuou…
-
Hello ,
Thanks for all the effort to create this repo. When I launch training it runs for a few steps and then I see no progress at all. Its just stuck without any progress for a long time. It stil…
-
The training example given seems to be missing the mask vector? In the paper the input to the model was the audio, mask and the phoneme sequence (which was aligned to the audio in the previous impleme…
-
I just installed TensorFlowTTS whose version is 1.8 by `pip install TensorFlowTTS` , but when i open the notebook\OneShotVoiceClone_Inference.ipynb, it shows import error:
![image](https://user-image…
wwdok updated
2 years ago
-
SeamlessM4Tv2 Released today seems to have all this and translation with streaming support ? Will it be better than Whisper and Coqui ?
jkfnc updated
3 months ago
-
想要請教您幾個問題
1. 想請問diff-vits這個項目與ns2 tts-v2的差別在哪裡
目前粗略看過去以及以前有看到,似乎是將主模型改成vits但留下了naturalspeech的架構?
2. 我在tts-v2的模型中測試了一個1500+音色 600+hr的訓練資料集,測試集外數據還是會有大部分不太相似的情況。
是否真如論文所測試,需要更大量的數據集才能有集外的泛化性效果。您認為大…
-
有没有可能把一个人的干净语音和文本作为输入,输出这个人读此文本的语音,且期间不进行训练? 想用一个模型解决所有vc
-
hi ! thank you very much for your work and this amazing repo
i try train the branch v4 i have something very wrong here
when i train about 3 hours it's not change i have noise all the steps i us…
lpscr updated
2 months ago
-
Hello p0p4k,
I'm reaching out to you again with a question.
Thanks to your great help, I've successfully trained and inferred the Korean pflow model. During the inference process, I observed a f…