open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
https://openhlt.github.io/amphion/
MIT License
4.45k stars 379 forks source link

Add PicoAudio Model #249

Open zeyuxie29 opened 1 month ago

zeyuxie29 commented 1 month ago

✨ Description

The PR adds the PicoAudio into the Amphion toolkit.

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

🚧 Related Issues

[List the issue numbers related to this PR]

👨‍💻 Changes Proposed

🧑‍🤝‍🧑 Who Can Review?

@zhizhengwu @HeCheng0625

🛠 TODO

✅ Checklist