open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
https://openhlt.github.io/amphion/
MIT License
4.28k stars 365 forks source link

Release Emilia and Emilia-Pipe #227

Closed yuantuo666 closed 1 week ago

yuantuo666 commented 2 weeks ago

✨ Description

We release the Emilia, an extensive, multilingual, and diverse dataset, and Emilia-Pipe, the first open-source preprocessing pipeline designed to transform in-the-wild speech data into high-quality training data with annotations for speech generation.

Major contribution for this PR: @HarryHe11 @shangqwe123 @yuantuo666 @lixuyuan102

🚧 Related Issues

None

👨‍💻 Changes Proposed

🧑‍🤝‍🧑 Who Can Review?

@HarryHe11 @jiaqili3 @RMSnow @HeCheng0625

🛠 TODO

None

✅ Checklist

HarryHe11 commented 2 weeks ago

Hi Chaoren @yuantuo666 , thank you so much for raising this PR and all of your efforts @yuantuo666 @shangqwe123 @lixuyuan102!

I notice that we haven't provide the list for the source audios yet in this pr. Maybe we could complete this part before merging!

HarryHe11 commented 2 weeks ago

@RMSnow Hi Xueyao, thank you so much for your helpful suggestions! I have addressed all of your comments accordingly!

HarryHe11 commented 1 week ago

@yuantuo666 @shangqwe123 @lixuyuan102 Chaoren, Zengqiang, Xuyuan,

I think we'd better merge this pr after we got our arxiv link, and done all of the following tasks.