grammatek / simaromur

Icelandic TTS (text-to-speech) service for Android
Apache License 2.0
8 stars 2 forks source link

WIP: V1.4.x #151

Closed lumpidu closed 7 months ago

lumpidu commented 7 months ago

This branch adds the new AI voice Steinn to Símarómur

This VITS voice Steinn is trained on 16kHz sample rate converted Talrómur voice H (i.e. Steinn) via Piper TTS and uses IPA phonemization. The model inputs are adapted with the appropriate phoneme conversions, like padding every symbol with 0, adding BOS, EOS, etc.

In contrast to the earlier voices, this model is trained on commas, question marks, exclamation marks, dots at the end of sentences and also on tagged silences of the training set. The latter can be controlled at inference time by adding # at the appropriate position of an utterance.

The resulting voice performance is quite good and shows very good runtime performance.

The corresponding voice repository will be updated accordingly.