[WIP] Add get_voices, ElevenLabs, Refactor synth, speak method, streaming and various bug fixes

willwade commented 1 year ago

This is a monster, mega, possibly too large a PR.

the key aim is NOT to break functionality in the original wrapper. But we have added to it a LOT. Any changes now we are just going to bug fix rather than look at new features..

I've tried SUPER hard to not bring in any breaking changes. But please test!!

Types of changes

[x] Bug fix (non-breaking change which fixes an issue)
[x] New feature (non-breaking change which adds functionality)
[x] Breaking change (fix or feature that would cause existing functionality to change)
[x] I have read the CONTRIBUTING document.
[x] My code follows the code style of this project.
[x] My change requires a change to the documentation.
[x] I have updated the documentation accordingly.
[x] I have added tests to cover my changes.
[x] All new and existing tests passed.

willwade commented 6 months ago

Note; there is something really annoying about the synth method that is documented. Im not sure im dealing with it right

In the docs it states

tts.synth('<speak>Hello, world!</speak>', 'hello.mp3', format='mp3)

this actually isnt possible. I think it might have been a typo of sorts because i think its meant to be

tts.synth_to_file('<speak>Hello, world!</speak>', 'hello.mp3', format='mp3)

So - what I've done is made a synth method to use like it was documented in abstract

    def synth(self, text: str, filename: str, format: Optional[FileFormat] = "wav"):
        """
        Synthesizes text to speech and directly saves it to a file. Alias
        """
        self.synth_to_file(text,filename,format)

I'll be honest this grates the hell out of me. Because of course most of the engines have a synth method themselves. I fear about confusion. Its one of the reasons I have introduced speak and speak_streamed to help move away from this. But thoughts welcome

willwade commented 3 months ago

Sorry - totally forgot I left this PR open. Closing it - If I was mediatechlab I wouldnt accept this PR as its too massive. Drop me a line if you want to pick it back up though..

mediatechlab / tts-wrapper

[WIP] Add get_voices, ElevenLabs, Refactor synth, speak method, streaming and various bug fixes #25

Types of changes