An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Apache License 2.0
2.73k
stars
288
forks
source link
On the problem of missing three phonemes "ir", "E" and "En" #133
Hello, developers, I am practicing a diffsinger sound library with OpEncpop data set. As shown in the figure, the phonemes of "en", "e" and "ir" are missing during preprocessing.
After being instructed by others, I found that the phoneme "En" is not the same as the phoneme "en" and the phoneme "E" is not the same as the phoneme "e".Moreover, the factor "ir" is not a correct pronunciation of Chinese Pinyin, which is more like an English pronunciation.I hope developers can follow up and fix this bug in time.
The current situation is that four phonemes are missing, and the preprocessing step is reported as an error, and the train.lengths file is not generated, which leads to the failure to start the later training.We are using variance automatic pitch prediction model.
Hello, developers, I am practicing a diffsinger sound library with OpEncpop data set. As shown in the figure, the phonemes of "en", "e" and "ir" are missing during preprocessing.
After being instructed by others, I found that the phoneme "En" is not the same as the phoneme "en" and the phoneme "E" is not the same as the phoneme "e".Moreover, the factor "ir" is not a correct pronunciation of Chinese Pinyin, which is more like an English pronunciation.I hope developers can follow up and fix this bug in time. The current situation is that four phonemes are missing, and the preprocessing step is reported as an error, and the train.lengths file is not generated, which leads to the failure to start the later training.We are using variance automatic pitch prediction model.