Closed tresoldi closed 3 years ago
Pinging @lingulist, @xrotwang ,@cormacanderson, @SimonGreenHill, @chrzyki
decide what to do with missing tone that include other segmental or supra-segmental features: ˥˧̰, ˥˩˩˥, ˦ˀ, ˦˥̰, ˦̰, ˧˨ˤ, ˧˨̤, ˧˩̤, ˧˩̰, ˨˩̤, ↓˦˨, ↓˦↓˦
Leave it: it is not serious to mark glottalization on the tone, if you can mark it on the vowel, etc., so it is an artifact of a rather problematic sampling.
@tresoldi, not all the aliases need to be added, it is enough to contrast them with the CLTS counterpart, right?
@LinguList it is a matter of deciding whether to add them or not; in this list we have all graphemes used in Phoible (even if only once) that we though might be added. I agree with you on the tones, for example -- marking features such as nasalization or pharyngealization on the tone segment sound not only a bad practice, but wrong.
I would already add most of those listed, with the exception of clicks (I'd only add those which are transparent and easy) and the co-articulated ones. In particular, what is your take on the frictionalized vowels?
Sorry, yes, frictionalized vowels would require a new feature. It is in fact not difficult to add the feature, so it could definitely be done. One could call it "frictionalization" and "with_fricture"?
I don't know how many instances there were, but for the sake of completeness, I agree it's a good idea to add these. Probably better to call the new feature for the frictionalised vowels "with_friction".
When starting to prepare the PR, and also working on the inventories, I found more cases. They are co-articulated consonants that are either rejected or wrongly parsed as clusters.
I have quickly discussed them with @cormacanderson already. The changes would be:
kp
(IPA: k͡p), currently interpreted as "from voiceless velar stop to voiceless bilabial stop cluster"gb
(IPA: ɡ͡b), currently interpreted as "from voiced velar stop to voiced bilabial stop cluster"tp
(IPA: t͡p), currently interpreted as "from voiceless alveolar stop to voiceless bilabial stop cluster"db
(IPA: d͡b), currently interpreted as "from voiced alveolar stop to voiced bilabial stop cluster"qʡ
(IPA: q͡ʡ), currently interpreted as "from voiceless uvular stop to voiceless epiglottal stop cluster"nm
(IPA: n͡m), currently giving an UnknownSound
ŋm
(IPA: ŋ͡m), currently giving an UnknownSound
Treatment would be in line with the one we currently have for /ɧ/. Note that the code already support these sounds, it is just parsing them in the wrong way and unable to render them as string:
>>> str(c.bipa["voiceless labio-velar stop consonant"])
'<?><!>'
I'd say, these phoible-related issues have now been decided to be handled by modifying links to clts, not CLTS itself.
any remaining issues, e.g., that would require to modify CLTS (features, new sounds), should be added here.
I close this as we have a new way to tackle problems now.
After discussion with Cormac, these are the issues with Phoible graphemes that should be addressed:
dlʷ
as an alias (voiced lateral affricate)d̠ʓ
as an alias ( ʒʲ )d̪l̪
as an alias (voiced lateral affricate)d̪ð̪
as an alias (voiced dental affricate)d͇z͇
(new sound, alveolar diacritic)i͓
,u͓
, andɯ͓
as new sounds (frictionalised diacritic U+0353, might require new feature)kǀ̪
,kǀ͓x
,kǀ͓ˀ
,kǀ͓ˠʰ
,kǁ͓xʰ
,kǁ͓ˀ
,kǂ͓ˡ
,kǂ͓ˡx
,kǂ͓ˡʰ
,kǃ̪
,kǃ͓
,k‼
,k‼x
,k‼xʼ
,k‼ʰ
,k‼ʰʼ
,k‼ʼ
,qǀ
,qǀʼ
,qǁ
,qǁʼ
,qǂ
,qǂʼ
,qǃ
,qǃʼ
,qʘ
,qʘʼ
,ŋǂ͓ˡ
,ŋ̤ǀ
,ŋ̤ǀ͓
,ŋ̤ǁ
,ŋ̤ǂ
,ŋ̤ǂ͓ˡ
,ŋ̤ɡǃ
,ŋ̥ǀ͓xˀ
,ŋ̥ǁ͓ʰ
,ŋ̥ǂxˀ
,ŋ̥ǂʰ
,ŋ̥ǂˀ
,ŋ̥ǂ͓ˡxˀ
,ŋ̥ǂ͓ˡʰ
,ŋ̥ǂ͓ˡˀ
,ŋ̥ǃˠˀ
,ŋ̥ǃ̠ʰ
,ŋ‼
,ŋ‼ʱ
,ɡǀ͓x
,ɡǁ͓
,ɡǂ͓ˡ
,ɡǂ͓ˡx
,ɡ̤ǀ
,ɡ̤ǀ͓
,ɡ̤ǁ
,ɡ̤ǂ
,ɡ̤ǂ͓ˡ
,ɡ̰ǀ͓x
,ɡ̰ǂx
,ɡ̰ǂ͓ˡx
,ɡ̰ǃx
,ɡ‼
,ɡ‼x
,ɡ‼xʼ
,ɡ‼ʱ
,ɢǀ
,ɢǀqʰ
,ɢǁ
,ɢǁqʰ
,ɢǂ
,ɢǃ
,ɢǃqʰ
,ɢʘ
,ʔŋǀ
,ʔŋǁ
,ʔŋǂ
,ʔŋǃ
,ʔŋʘ
,kʟ̥ʼ
andkʟ͓̥ʼ
(we currently use pre-Unicode SIL extensions,k
andkʼ
, see here and here)mw
as an alias (labialized bilabial)ndl
as an alias (voiced lateral affricate)aʲː
as an alias (long diphthong)ãʲ
as an alias (long diphthong)aʲ
as an alias (diphthong)eʲ
as an alias (diphthong)oʲ
as an alias (diphthong)õʲ
as an alias (diphthong)i̩ː
as an alias (even though arguably tautological)dl
as an alias (voiced lateral affricate)tl
as an alias (voiceless lateral affricate)w̜
andw̜ʲ
ŋmʲ
,ŋmʷ
,ŋmˤ
,ŋɡmb
,ŋ̥m̥
˥˧̰
,˥˩˩˥
,˦ˀ
,˦˥̰
,˦̰
,˧˨ˤ
,˧˨̤
,˧˩̤
,˧˩̰
,˨˩̤
,↓˦˨
,↓˦↓˦
ŋm
andŋmɡb
(co-articulated nasal stops)In the same round, the following issues could be addressed as well: