SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
https://arxiv.org/abs/2410.06885
MIT License
7.48k stars 924 forks source link

How to customizable Voices? #430

Closed ldgoooo closed 2 weeks ago

ldgoooo commented 2 weeks ago

Checks

Question details

Does it support some custom parameters, such as tone, pitch, softened voice, etc.?

SWivid commented 2 weeks ago

I have thoroughly reviewed the project documentation and read the related paper(s).

could use ref_audio to control