dmort27 / epitran

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

MIT License

649 stars 123 forks source link

First of all, thanks for making this great software. Works perfect for me. Also adding rules is explained very clearly and I could implement it with ease.

I am parsing, then converting a dutch wordlist to ipa and xsampa, trying to generate a dict for building voices. I saw there's a arpabet mapping too, which would be handy training sphinx. Should I create a class, and ipa2arpa.csv like you did for the xsampa conversion?

I am now using xsampa like this:

`from epitran.xsampa import XSampa

set to dutch

epi = epitran.Epitran('nld-Latn')

x-sampa class

xs = XSampa()

s = epi.transliterate( word ).encode("utf-8") s_a = xs.ipa2xs( unicode(s, "utf-8") ) ` So I could also make a class like xsampa for ipa2arpa, or there is a simpler way?

dmort27 / epitran

question - Method for Arpabet conversion? #8

set to dutch

x-sampa class