fishaudio / fish-speech

Brand new TTS solution
https://speech.fish.audio
Other
14.62k stars 1.11k forks source link

Review generated chunks before merging #655

Open GalenMarek14 opened 2 weeks ago

GalenMarek14 commented 2 weeks ago

Self Checks

1. Is this request related to a challenge you're experiencing? Tell us your story.

When generating long texts, sometimes only a small part comes out poorly. We have to either regenerate the entire text or manually edit the bad part.

2. What is your suggested solution?

I suggest reviewing each generated chunk of audio before final merging, regenerating problematic parts, and merging with optional silence between chunks (like the ability to insert milliseconds/seconds of silences between chunks before merging). This could be done with a feature like (plus silence insertion): textgen From alltalk_tts

Additionally, optional chunking method of splitting the text on every new paragraph would be good too.

An alternate, easier method would be to allow batch generation and save each output as a separate file, either by saving each paragraph as a different file or by generating from a folder of TXT files.

3. Additional context or comments

Thank you very much for this project.

4. Can you help us with this feature?