clics / pyclics

python package implementing the CLICS processing workflow
Apache License 2.0
3 stars 0 forks source link

Handling "or" concepts in clics #2

Open LinguList opened 5 years ago

LinguList commented 5 years ago

ARM or HAND is a colexificaton itself, so if our dataset contains it, we won't capture the colexification, since it is silently annotated in the original data. For an arm/hand survey, however, we'd like to split those. Can / should we try and do this upon import from cldf? The procedure would be: search for " or " glosses and split them acc. to their narrower descendants. Caveat: not all relations are amenable to that, maybe we'll need a hand-coded list (but that's perfectly doable!).

LinguList commented 5 years ago

Having seen #6, I was reminded of this issue. How do we address it: do we ignore it for the time being (which is in fact fine), or do we try to address it? If the latter, we'd need a hand-coded list, as we do not know the effect on kinship terms: if we have brother, and big brother, etc., it might well screw the hierarchy if we go all the ladder down here? Or maybe this is only consequent? In any way, this is something we need to address in some way, be it by explicitly ignoring it for now, or by making some decisions. I'd gladly check the concepts to select the candidates if manual disseparation is needed. Even a quick check on how many concepts with broader-relations we have, may be illuminating here...

xrotwang commented 5 years ago

Ignored at least for release 2.0.