Closed hansheng-zhang closed 2 days ago
Thanks for the contribution! Some questions
- I notice that you add msd and mpd implementations for jets, instead of using existing ones. Is it possible to reuse existing discriminators to improve readibility?
- If some codes are reference other repos, please make sure to add acknowledgements in the readme, and on the top of each file.
- Demos of your reproduction demos would be welcomed.
✨ Description
We release the JETS (Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech) model in Amphion. JETS has a simplified training pipeline and outperforms a cascade of separately learned models. Specifically, JETS is jointly trained FastSpeech2 and HiFi-GAN with an alignment module.
How to test: see egs/Jets/README.md
Major contribution for this PR: @hansheng-zhang @chenjianzhen666 @So1a
👨💻 Changes Proposed
🧑🤝🧑 Who Can Review?
@lmxue @RMSnow
✅ Checklist