rhasspy / piper

A fast, local neural text to speech system
https://rhasspy.github.io/piper-samples/
MIT License
5.74k stars 412 forks source link

Kurdish model needed #498

Open willwade opened 4 months ago

willwade commented 4 months ago

We desperately need Kurdish TTS to support people without speech. There are no TTS systems supporting this (apart from eSpeak)

I'm really parking this here as a note to anyone else wanting to do this too. Here are some datasets

(sort of related:

synesthesiam commented 3 months ago

The next version of Piper will be moving away from espeak (for licensing reasons), so I will need a pronunciation dictionary for Kurdish similar to these: https://mfa-models.readthedocs.io/en/latest/dictionary/index.html#dictionary

The dictionary can be generated with espeak-ng too, though the quality depends on aspects of the language itself. Arabic is especially difficult because the written form can be spoken many different ways depending on context.

willwade commented 3 months ago

Interesting. Any info on the licensing issues with espeak??

synesthesiam commented 3 months ago

espeak and its successor espeak-ng are LGPL. Depending on who you ask, this means that any project using it should also be some form of GPL. Since Piper is MIT licensed, I want to make sure there's no question that it can be used everywhere.