rendchevi / nix-tts

🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation
MIT License
233 stars 31 forks source link

Discussion: How about distil MAS result from teacher VITS to replace the Text Aligner? #10

Open TinaChen95 opened 1 year ago

TinaChen95 commented 1 year ago

I found that VITS's MAS result is very accurate, so why not distil the duration information to train the student model?