Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding, which is capable of both speech continuation and editing.
β¨ Description
The PR adds the UniCATS into the Amphion toolkit.
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding, which is capable of both speech continuation and editing.
UniCATS repo: https://github.com/cpdu/unicats
UniCATS paper: https://arxiv.org/abs/2306.07547
UniCATS demo page: https://cpdu.github.io/unicats/
The PR is the Final project for AIR6063. This is a solo project by ηζ΄ηΊ (Wang Yangjun), ID: 119010315
π¨βπ» Changes Proposed
π§βπ€βπ§ Who Can Review?
@zhizhengwu @Adorable-Qin @HeCheng0625
π TODO
β Checklist