erew123 / alltalk_tts

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
GNU Affero General Public License v3.0
864 stars 98 forks source link

Standard TTS generation example from the documentation does not work #293

Closed waan1 closed 1 month ago

waan1 commented 1 month ago

Describe the bug Narrator Example example works well, but Standard TTS generation example fails.

To Reproduce Steps to reproduce the behavior: The configuration is exactly the same as in my previous issue report - problem with Polish language #292

curl -X POST "http://127.0.0.1:7851/api/tts-generate" -d "text_input=All of this is text spoken by the character. This is text not inside quotes, though that doesnt matter in the slightest" -d "text_filtering=standard" -d "character_voice_gen=female_01.wav" -d "narrator_enabled=false" -d "narrator_voice_gen=male_01.wav" -d "text_not_inside=character" -d "language=en" -d "output_file_name=myoutputfile" -d "output_file_timestamp=true" -d "autoplay=true" -d "autoplay_volume=0.8" {"status":"generate-failure","error":"An error occurred"}%

[AllTalk TTSGen] 2.58 seconds. LowVRAM: False DeepSpeed: False [AllTalk TTSGen] All of this is text spoken by the character. This is text not inside quotes, though that doesnt matter in the slightest

Screenshots If applicable, add screenshots to help explain your problem.

Text/logs If applicable, copy/paste in your logs here from the console.

Desktop (please complete the following information): AllTalk was updated: [approx. date] Custom Python environment: [yes/no give details if yes] Text-generation-webUI was updated: [approx. date]

Additional context Add any other context about the problem here.

waan1 commented 1 month ago

I see that it fails when narrator_enabled is not true Also if autoplay_volume is not defined even if autoplay=false

erew123 commented 1 month ago

Hi @waan1

Not sure I can re-create the fault you're describing. I have copied/pasted the exact CURL command from above, I have also altered the narrator_enabled and the autoplay:

image

All works fine.

Ive had no other complaints or issues like this before, so Im a little lost as to what to suggest could be the exact problem. What I could suggest is moving to the V2 beta as there is extra levels of debugging available in that one, plus there are very beneficial changes to the API that simplify generation.

https://github.com/erew123/alltalk_tts/tree/alltalkbeta

erew123 commented 1 month ago

Hi @waan1 If you need more information or to discuss this more, please reply back here.

Thanks

waan1 commented 1 month ago

It still did not work. Maybe because of difference between windows and linux (manjaro). I'm going to install beta version and check if it is going to work fine.