bootphon / phonemizer

Simple text to phones converter for multiple languages
https://bootphon.github.io/phonemizer/
GNU General Public License v3.0
1.19k stars 166 forks source link

Two phones in Arabic are mapped to the same symbol #171

Closed LamOne1 closed 1 month ago

LamOne1 commented 1 month ago

I noticed that "ظ" (ðˤ) and "ذ" (ð) are both mapped to "ð"

from phonemizer.backend import BACKENDS
g2p = BACKENDS['espeak'](language='ar', words_mismatch='warn', preserve_punctuation=False)
g2p.phonemize(["ظ"], separator=Separator(phone=None, word=' ', syllable='|'), strip=True)

# -----> ðaaʔ

g2p.phonemize(["ذ"], separator=Separator(phone=None, word=' ', syllable='|'), strip=True)

# -----> ðaal

packages: phonemizer 3.2.1 python 3.8.19

mmmaat commented 1 month ago

Hi, this is related to the espeak backend, not phonemizer. Nothing we can do here. You may want to open an issue in the espeak-ng repo.

$ espeak-ng -q -x --ipa -v ar "ظ"
ðˈaaʔ
$ espeak-ng -q -x --ipa -v ar "ذ"
ðˈaal