CUNY-CL / wikipron

Massively multilingual pronunciation mining
Apache License 2.0
315 stars 71 forks source link

[bul] inconsistencies #142

Open kylebgorman opened 4 years ago

kylebgorman commented 4 years ago

As reported here there are some inconsistencies with /l/ and the dental stops. As [discussed here](https://en.wiktionary.org/wiki/Wiktionary:Information_desk/2020/April#Performing_bulk_edits, there is a pronunciation module and pron template for Bulgarian on Wiktionary; we may be just bulk-migrate Bulgarian to that template and re-scrape.

kylebgorman commented 3 years ago

Just adding to this: the phone list is filtering out ʊ and ŋ. I suspect these are merely allophones of /u/ and /n/, respectively, but we should remove them from the so-called phonemic transcriptions upstream.