collabora / WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.
https://collabora.github.io/WhisperSpeech/
MIT License
3.8k stars 207 forks source link

new example for "text_to_playback" method #113

Closed BBC-Esq closed 6 months ago

BBC-Esq commented 7 months ago

Uses the newly-accepted text_to_playback method in pipeline.py, which copies the "generate" method but plays the audio at the end as well (using sounddevice).

Any feedback is welcome. Thinking of renaming the examples in this folder to make them shorter and more descriptive at the same time. It's becoming a little confusing with the new "generate_to_playback" method not being suited to process/playback sentence-by-sentence (like when using the "generate" method and sounddevice directly) like in a few other examples...I need to clarify the naming conventions I think...any suggestions welcome.

I also need to minor update the instructions since "soundfile" is not installed automatically...Will do separate pull requests for that though.