openvpi / DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Apache License 2.0
2.73k stars 288 forks source link
acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

DiffSinger (OpenVPI maintained version)

arXiv downloads Bilibili license

This is a refactored and enhanced version of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism based on the original paper and implementation, which provides:

Overview Variance Model Acoustic Model
arch-overview arch-variance arch-acoustic

User Guidance

中文教程 / Chinese Tutorials: Text, Video

Progress & Roadmap

Architecture & Algorithms

TBD

Development Resources

TBD

References

Original Paper & Implementation

Generative Models & Algorithms

Dependencies & Submodules

Disclaimer

Any organization or individual is prohibited from using any functionalities included in this repository to generate someone's speech without his/her consent, including but not limited to government leaders, political figures, and celebrities. If you do not comply with this item, you could be in violation of copyright laws.

License

This forked DiffSinger repository is licensed under the Apache 2.0 License.