clics / clicsbp

CLDF dataset on Body Part Colexifications
Creative Commons Attribution 4.0 International
1 stars 0 forks source link

Reduce body part concepts #29

Closed AnnikaTjuka closed 2 years ago

AnnikaTjuka commented 2 years ago

This PR deletes body concepts that do not have a good enough coverage across language families.

AnnikaTjuka commented 2 years ago

@LinguList As discussed, I deleted body concepts that occurred only in only a few languages. We are now at 80 body concepts.

LinguList commented 2 years ago

One question: have you by any chance looked into matches with Wordnet? I am asking, since we'd like to have a look at the hierarchy underlying these body parts here, so a good coverage would be quite useful.

AnnikaTjuka commented 2 years ago

I haven't. But the concepts seem fairly "standard". I will double-check whether they occur in WordNet.

AnnikaTjuka commented 2 years ago

@LinguList I checked with Bond et al (2013), i.e., Multilingual WordNet which we have in NoRaRe, and 57 of the body part concepts are covered.

LinguList commented 2 years ago

Okay, we can have a look at the network then later.