openvpi / DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Apache License 2.0
2.62k stars 275 forks source link

On the problem of missing three phonemes "ir", "E" and "En" #133

Closed Wangs-official closed 11 months ago

Wangs-official commented 11 months ago

Hello, developers, I am practicing a diffsinger sound library with OpEncpop data set. As shown in the figure, the phonemes of "en", "e" and "ir" are missing during preprocessing. imageimageimage

After being instructed by others, I found that the phoneme "En" is not the same as the phoneme "en" and the phoneme "E" is not the same as the phoneme "e".Moreover, the factor "ir" is not a correct pronunciation of Chinese Pinyin, which is more like an English pronunciation.I hope developers can follow up and fix this bug in time. The current situation is that four phonemes are missing, and the preprocessing step is reported as an error, and the train.lengths file is not generated, which leads to the failure to start the later training.We are using variance automatic pitch prediction model.

Wangs-official commented 11 months ago

By the way, there is also an i0 phoneme.