matatonic / openedai-speech

An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
GNU Affero General Public License v3.0
192 stars 32 forks source link

Streaming is unavailable when invoking the API #10

Closed wangtao201919 closed 4 weeks ago

wangtao201919 commented 1 month ago

Thank you for this great project. When I use xtts as the speech engine and try to stream audio playback, I observe that the code outputs the audio to the player only after the entire audio synthesis is completed. However, when I directly call OpenAI's TTS API, I can play the audio during the synthesis process.

matatonic commented 1 month ago

This is a known issue and planned for a future release.