open-mmlab / Amphion

Amphion (/Γ¦mˈfaΙͺΙ™n/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
https://openhlt.github.io/amphion/
MIT License
4.28k stars 365 forks source link

Add UniCATS implementation #202

Open cyysky2 opened 2 months ago

cyysky2 commented 2 months ago

✨ Description

The PR adds the UniCATS into the Amphion toolkit.

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding, which is capable of both speech continuation and editing.

The PR is the Final project for AIR6063. This is a solo project by ηŽ‹ζ΄‹ηΊ (Wang Yangjun), ID: 119010315

πŸ‘¨β€πŸ’» Changes Proposed

πŸ§‘β€πŸ€β€πŸ§‘ Who Can Review?

@zhizhengwu @Adorable-Qin @HeCheng0625

πŸ›  TODO

βœ… Checklist

lmxue commented 2 months ago

Please format the code using 'black' formatter as described here.

Please provide the final checkpoints and samples.

Additionally, the commit information should be concise.