-
不知道是不是我的操作有问题,下载模型后,跑了openvoice_app.py得到的声音感觉不像原来的。
原声录音很清晰,没有背景音,单人,用vits fast fine-tuning训练的跑了40轮左右就能得到还原度很高的克隆语音了。
但是用本项目的就不行,似乎是这个repo的作者给的模型不通用。
不知道大家有没有碰到这样的问题?
-
Hi guys!
I have run everything with Russian dataset to get alignments. Then I used these alignments to train a speech synthesis model (Fast Speech 2) on phoneme level. But then I can't figure out …
diff7 updated
3 years ago
-
Hi team
First of all, great job with MetaVoice. Everything in the repository works as expected.
I went through the code to understand the 4 stage inference and correlate it with the documentatio…
-
Dataloader name: `cmu_wilderness_multilingual_speech_dataset/cmu_wilderness_multilingual_speech_dataset.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?cmu_wilderness_multili…
-
Hi!
I've tried training the model using my own samples but it gave me assertion error in preprocess.py, so Ive tried changing the n_frames value from 128 to 32 and it seemed to have worked. Also I'v…
-
## Deep Learning for Text-to-Speech Synthesis, using the Merlin toolkit
This [tutorial](http://www.interspeech2017.org/program/tutorial-descriptions/) will combine the theory and practical applicat…
-
when I run audio-to-txt using api, it always run on my CPU and my gpu is free, I want to set it run on my gpu to improve running speed.
-
Hey,
First of all, really amazing work that you are doing, I came here from some of your youtube videos, very interesting stuff with RPM and Unity.
Following this tutorial I did an example of me…
-
### Describe the bug
When running text-to-speech on an english model, when tts tries to write the .wav file, it runs out of memory. I'm running on cpu only. My machine has ~14GB available RAM
I …
-
We have trained StyleTTS2 model for Hindi language. Initially we trained PL-bert for Hindi considering Espeak phonemizer and Indicbert tokenizer. Then we utilized that newly trained Hindi PLbert by re…