JarodMica / audiobook_maker

GNU General Public License v3.0
293 stars 47 forks source link

V3.0 End of Words At End of Sentences Cut Off #69

Open FJCCOMMISH opened 5 days ago

FJCCOMMISH commented 5 days ago

Other than adding an extra word, then editing the audio, is there a way to avoid cutting off the last sounds of the last word in a sentence?

For example this sentence: "This is the book for you if you want to explore the history of democracy and its shaping forces."

It is read output as this: "This is the book for you if you want to explore the history of democracy and its shaping force." The ending s sound is missing.

This happens often, with many sentences.

JarodMica commented 5 days ago

I believe this isn't an issue with the processing of the file, but an unfortunate issue with tortoise TTS. It can be mitigated through training, but I'm not sure of your comfortablilty of training tortoise TTS models. I'm almost done with the styletts API and it's much more accurate at pieces like that, and then even more so, should be adding F5TTS soon as well so look forward to both of those additions!

edbartz commented 5 days ago

In v2 of the audiobook maker, I saw some improvement with this issue in tortoise by adding a space before the period at the end of sentences. I hope this helps.

FJCCOMMISH commented 18 hours ago

In v2 of the audiobook maker, I saw some improvement with this issue in tortoise by adding a space before the period at the end of sentences. I hope this helps.

Yes, that did address the dropped last sound, but not at the end of the final sentence. It voices that one, but cuts it off.

FJCCOMMISH commented 18 hours ago

I believe this isn't an issue with the processing of the file, but an unfortunate issue with tortoise TTS. It can be mitigated through training, but I'm not sure of your comfortablilty of training tortoise TTS models. I'm almost done with the styletts API and it's much more accurate at pieces like that, and then even more so, should be adding F5TTS soon as well so look forward to both of those additions!

I train with Tortoise and RVC. What can be done in training to address the cutoff?