openvpi / DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Apache License 2.0
2.73k stars 288 forks source link

On the problem of missing three phonemes "ir", "E" and "En" #133

Closed Wangs-official closed 1 year ago

Wangs-official commented 1 year ago

Hello, developers, I am practicing a diffsinger sound library with OpEncpop data set. As shown in the figure, the phonemes of "en", "e" and "ir" are missing during preprocessing. imageimageimage

After being instructed by others, I found that the phoneme "En" is not the same as the phoneme "en" and the phoneme "E" is not the same as the phoneme "e".Moreover, the factor "ir" is not a correct pronunciation of Chinese Pinyin, which is more like an English pronunciation.I hope developers can follow up and fix this bug in time. The current situation is that four phonemes are missing, and the preprocessing step is reported as an error, and the train.lengths file is not generated, which leads to the failure to start the later training.We are using variance automatic pitch prediction model.

Wangs-official commented 1 year ago

By the way, there is also an i0 phoneme.