Open Cardroid opened 2 years ago
"DiffSinger" takes in phoneme, pitch, and duration. You need a Grapheme-to-Phoneme tool, like g2p_en for English, pypinyin for Chinese, or something for your language.
"DiffSinger" takes in phoneme, pitch, and duration. You need a Grapheme-to-Phoneme tool, like g2p_en for English, pypinyin for Chinese, or something for your language.
Thank you for your answer. I have another question. How do you fine-tune the vocoder for the new dataset?
Can you please explain how to train this Diffsinger in a different language? Does the vocoder need to be trained from scratch then, how?
Hi, can you please describe what needs to be done to train Diffsinger in a different language?
I think this "DiffSinger" model is based on Chinese. Please give me advice on how to train them in another language. Thank you for share!