open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
https://openhlt.github.io/amphion/
MIT License
4.41k stars 373 forks source link

Add TransformerVC & VITSVC implementation #183

Open ArkhamImp opened 4 months ago

ArkhamImp commented 4 months ago

✨ Description

VC implementations with Transformer and VITS model architectures. Samples refer to the previous PR: https://github.com/open-mmlab/Amphion/pull/90

👨‍💻 Changes Proposed

🧑‍🤝‍🧑 Who Can Review?

@lmxue @RMSnow

✅ Checklist

RMSnow commented 4 months ago

Hi @ArkhamImp, Haorui @HarryHe11 and I are also developing the zero-shot VC/SVC. We will leave this PR to review when we find a unified way to integrate all our code including yours!