Closed mashabelyi closed 4 years ago
Sorry for the late response. The answer is that Epitran was not made to do what you want to do (extract phoneme-grapheme alignments). The behavior you is expected—these methods were added with a very specific application in mind which did not require accurate alignments between the two representations, only some alignment. Perhaps this code should be removed. In any case, Epitran, because of its architecture, will only get you part way to phoneme-grapheme alignments (phonemic representations). You must do the rest with an aligner.
Got it, thanks for your response.
Thank you for this great tool! I was hoping to use Epitran to extract frequencies of grapheme-phoneme alignment in different languages. But I am running into issues when using the
word_to_tuples
andword_to_segs
features.Here is the output of
epi.word_to_tuples
for the wordtough
in EnglishHere is the output for
choice
I'd expect the phonetic form
/f/
intough
to correspond to eitherg
orh
. And the phonetic form/s/
inchoice
to correspond toc
. However, that's not the case. I am wondering if this is expected behavior or a bug?