SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
https://arxiv.org/abs/2410.06885
MIT License
6.76k stars 786 forks source link

French model released #434

Open RASPIAUDIO opened 4 hours ago

RASPIAUDIO commented 4 hours ago

Checks

Question details

Hi Please find an improvised tutorial on how to train a new language for F5-TTS simply using the gradio web interface

https://www.youtube.com/watch?v=UO4usaOojys

Find also the link for the french model I have trained on 80k samples, 100 epochs, on single speaker, the google drive link is in the video comment. As well as training material used in the video description. Please subscribe to my channel to support!

My feeling is that the more I train the less sensitive it gets to the reference sample that I want to clone and that the result come closer to the training sample, any idea how to overcome this?

SWivid commented 3 hours ago

Hi @RASPIAUDIO , will need diverse training corpus. It's normal case if only single speaker to train.

RASPIAUDIO commented 3 hours ago

thanks I will try to complete the training with many different speakers