AwesomeTTS / awesometts-anki-addon

AwesomeTTS text-to-speech add-on for Anki
GNU General Public License v3.0
484 stars 100 forks source link

Addition of voicevox support (Japanese) #282

Closed 0xspringtime closed 1 year ago

0xspringtime commented 1 year ago

Was considering writing a new add on with awesometts as the template for creating tts using voicevox since we have a great API wrapper now https://github.com/tuna2134/voicevox-client https://voicevox-client.tuna2134.jp/ but was wondering whether it would be easier to just add it as a feature here. One impediment may be the need for voicevox_engine as a dependency. Is anyone else interested in this feature? I already know demand is there in the Japanese anki community.

luc-vocab commented 1 year ago

Can you describe voicevox in more detail ? is it a free TTS engine ? Is it easy to install it ?

0xspringtime commented 1 year ago

Yes, voicevox is a free and open source tts engine with 20+ premade voices that seems to reproduce things like pitch accent better than the paid japanese tts models and has caught on enough to be used for vocaloids. It has a cross-platform frontend that seems simple to install and even has an AUR release, the only problem is all the documentation is in Japanese. https://github.com/VOICEVOX https://voicevox.hiroshiba.jp/product/amehare_hau/

luc-vocab commented 1 year ago

Is your proficiency in Japanese sufficient to navigate through the documentation ?

0xspringtime commented 1 year ago

Probably not alone, but I haven't determined the extent to which that'd be necessary since tuna's wrapper documentation seems sufficient for integration and is in English. The problem I could see with integrating it here is that it's still reliant on the voicevox_engine as a dependency and I know that becomes cumbersome with anki since new modules have to be bundled in. So, unless some devs more familiar than I with the codebase think it's a good idea I don't want to overpush a feature specifically for the Japanese learning niche onto an established product if it could be hard to maintain.

luc-vocab commented 1 year ago

OK and how does the voice quality compare to neural voices such as azure, google ?

0xspringtime commented 1 year ago

imo much less robotic with more variety, along with the more accurate pitch accent mentioned before

0xspringtime commented 1 year ago

I was able to implement this feature using a different add-on, detailing it here https://github.com/0xspringtime/voicevox-anki so I'm going to close this as completed.