ManimCommunity / manim-voiceover

Manim plugin for all things voiceover
https://voiceover.manim.community/en/stable
MIT License
186 stars 25 forks source link

Simplify CoquiService #51

Closed osolmaz closed 1 year ago

osolmaz commented 1 year ago

Coqui has introduced a Python API for TTS since it was added to Manim Voiceover. This means that the workaround implemented to call it can be removed and the service simplified.

This also removes the word boundary generation logic that was previously implemented, but we have to do it anyway since TTS has been refactored and its latest version doesn't work with the workaround. We can fall back to word boundary detection with Whisper instead: CoquiService(transcription_model='base').