cldf-clts / clts

Cross-Linguistic Transcription Systems
https://clts.clld.org
14 stars 3 forks source link

Fix issues related to Phoible #27

Closed tresoldi closed 3 years ago

tresoldi commented 4 years ago

After discussion with Cormac, these are the issues with Phoible graphemes that should be addressed:

In the same round, the following issues could be addressed as well:

tresoldi commented 4 years ago

Pinging @lingulist, @xrotwang ,@cormacanderson, @SimonGreenHill, @chrzyki

LinguList commented 4 years ago

decide what to do with missing tone that include other segmental or supra-segmental features: ˥˧̰, ˥˩˩˥, ˦ˀ, ˦˥̰, ˦̰, ˧˨ˤ, ˧˨̤, ˧˩̤, ˧˩̰, ˨˩̤, ↓˦˨, ↓˦↓˦

Leave it: it is not serious to mark glottalization on the tone, if you can mark it on the vowel, etc., so it is an artifact of a rather problematic sampling.

LinguList commented 4 years ago

@tresoldi, not all the aliases need to be added, it is enough to contrast them with the CLTS counterpart, right?

tresoldi commented 4 years ago

@LinguList it is a matter of deciding whether to add them or not; in this list we have all graphemes used in Phoible (even if only once) that we though might be added. I agree with you on the tones, for example -- marking features such as nasalization or pharyngealization on the tone segment sound not only a bad practice, but wrong.

I would already add most of those listed, with the exception of clicks (I'd only add those which are transparent and easy) and the co-articulated ones. In particular, what is your take on the frictionalized vowels?

LinguList commented 4 years ago

Sorry, yes, frictionalized vowels would require a new feature. It is in fact not difficult to add the feature, so it could definitely be done. One could call it "frictionalization" and "with_fricture"?

cormacanderson commented 4 years ago

I don't know how many instances there were, but for the sake of completeness, I agree it's a good idea to add these. Probably better to call the new feature for the frictionalised vowels "with_friction".

tresoldi commented 3 years ago

When starting to prepare the PR, and also working on the inventories, I found more cases. They are co-articulated consonants that are either rejected or wrongly parsed as clusters.

I have quickly discussed them with @cormacanderson already. The changes would be:

Treatment would be in line with the one we currently have for /ɧ/. Note that the code already support these sounds, it is just parsing them in the wrong way and unable to render them as string:

>>> str(c.bipa["voiceless labio-velar stop consonant"])
'<?><!>'
cormacanderson commented 3 years ago
LinguList commented 3 years ago

I'd say, these phoible-related issues have now been decided to be handled by modifying links to clts, not CLTS itself.

LinguList commented 3 years ago

any remaining issues, e.g., that would require to modify CLTS (features, new sounds), should be added here.

LinguList commented 3 years ago

I close this as we have a new way to tackle problems now.