-
I think this "DiffSinger" model is based on Chinese.
Please give me advice on how to train them in another language.
Thank you for share!
-
- PaddleSpeech 活跃开发者 @jerryuhoo 基于非官方开源实现 https://github.com/So-Fann/VISinger 为 espent 贡献了 VISinger 模型代码,预处理部分不是自己写的,可以等 DiffSinger 串通整体流程后再开始 VISinger 的**模型**复现
- VISinger2 官方实现已开源:https://github.co…
-
The paper mentions using CoMoSpeech for the SVS task and briefly describes feature extraction, but I can't find that feature extraction or summing with the phoneme features in the code. Is that planne…
-
First of all, thanks for this great paper and official open-source code.
I want to check demo page which contains DiffSinger and baselines. But I can't access to demo page in the paper. How can I ge…
-
Hello! Great job! I would like to know a few things. Interested in SVS (POPCS)
1) Can you tell me about inference? What files are used for inferencing? What's the recipe? How did you manage to repeat…
-
I've had positive experiences using ColorSplitter for .wav files and creating datasets for DiffSinger. It excels in recognizing vocal timbres, especially in audio files ranging from 5 to 15 seconds. W…
-
Hello,
I'm interested in running command line inference using the .ckpt's of the model I trained, but after reading the instructions under `Inference` in `docs/GettingStarted.md` and the outputs o…
-
# 🌟 New model addition
## Model description
FastSpeech2 is a TTS model that outputs mel-spectrograms given some input text. From the [paper](https://arxiv.org/abs/2006.04558) abstract:
> Non-…
-
I find that there are some bad cases in F0 prediction. I recommend people to increase 'predictor_layers' or decrease 'predictor_dropout' to enhance the ability of pitch predictor for the part of MIDI …
-
### Acknowledgement
- [X] I have read Getting-Started and FAQ
### 🐛 Describe the bug
I have trained a DiffSinger DB with the new multi-dictionary support, but I cannot for the life of me reproduce …