open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
https://openhlt.github.io/amphion/
MIT License
4.45k stars 381 forks source link

Implement VitsSVC resume training / finetune feature #95

Closed viewfinder-annn closed 8 months ago

viewfinder-annn commented 8 months ago

✨ Description:

This PR addresses bugs in VitsSVC module, mainly related to:

  1. implement resume training / finetune feature
  2. fix bug for multiple acoustic content training

🚧 Related Issues:

#91

👨‍💻 Changes Proposed:

🧑‍🤝‍🧑 Who can review?

@RMSnow @lmxue

🛠 TODO

✅ Checklist:

viewfinder-annn commented 8 months ago

Just fixed problem 1, now this PR doesn't affect tasks other than SVC.

viewfinder-annn commented 8 months ago

Now the default training data config is the same across DiffWaveNet and VitsSVC. The exp_config.json is now user-friendly with fewer exposed options. I'll refractor SVC preprocess to svc.json in another PR soon.