fishaudio / fish-diffusion

An easy to understand TTS / SVS / SVC framework
https://diff.fish.audio
MIT License
635 stars 81 forks source link

Feature: add Auto Vocoder #57

Closed leng-yue closed 1 year ago

leng-yue commented 1 year ago

This branch aims to integrate a modified VAE into fish-diffusion. It will help us to replace mel spectrogram with an embedding and thus improve the robustness of diffusion-based svc and svs tasks.

Note: Auto Vocoder repo is not used AT ALL. It can't create any helpful representation.