[Help]: MaskGCT Phoneme Support

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

https://openhlt.github.io/amphion/

MIT License

7.78k stars 589 forks source link

[Help]: MaskGCT Phoneme Support #287

Open jd3655 opened 1 month ago

jd3655 commented 1 month ago

Hello,

Thank you for releasing the MaskGCT project. The sound quality is great.

In the english model, can you pass phonemes directly to the model for pronunciation?

Thank you

sankar-mukherjee commented 3 weeks ago

Is there any update on this?