openvpi / MakeDiffSinger

Pipelines and tools to build your own DiffSinger dataset.
BSD 3-Clause "New" or "Revised" License
87 stars 23 forks source link

Automatic `ph_num` generation method for universal dictionaries #11

Closed yqzhishen closed 10 months ago

yqzhishen commented 10 months ago

Motivation

The current MakeDiffSinger repository only supports automatic ph_num inference for monosyllabic phoneme systems. The reason is that we cannot judge the onset phones from a sequence of phones of a polysyllabic system, where one word can have multiple vowels, and not all vowels are onset phones.

However, the onset phones can still be inferred provided with proper extra information. Some useful information may be:

Also, with a graceful implementation of this idea, we may unify the algorithm to all universal dictionaries, despite their languages and phoneme system categories.