kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.
http://kaldi-asr.org
Other
13.95k stars 5.29k forks source link

[NEW RESOURCE] Introducing MARS5, open-source, insanely prosodic text-to-speech (TTS) model. #4918

Open akshhack opened 1 month ago

akshhack commented 1 month ago

Hey community members! 👋

We are super stoked to announce the open source release of MARS5, a new speech emulation model that is able to replicate even extremely tough prosody like sports commentary, anime, movies with just a few seconds of audio reference.

Check out our release: https://github.com/Camb-ai/MARS5-TTS

Watch our demo here:

https://github.com/kaldi-asr/kaldi/assets/21692676/50f0fe56-d1e0-42d9-b1e5-e14396605ac0

and the full release video: https://www.youtube.com/watch?v=bmJSLPYrKtE

We're excited to hear feedback and see the community build on top of it!

Quick links: Discord: https://discord.gg/ZzsKTAKM Github: https://www.github.com/camb-ai/mars5-tts Website: https://www.camb.ai/ Youtube: https://www.youtube.com/@camb-ai