concepticon / norare-data

Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts
Other
15 stars 1 forks source link

STEDT taxononomy #88

Closed AnnikaTjuka closed 4 years ago

AnnikaTjuka commented 4 years ago

Originally posted here: https://github.com/concepticon/concepticon-data/issues/36

Data: https://github.com/stedt-project/sss/tree/master/semcats/revision1 Website: http://stedt.berkeley.edu/~stedt-cgi/rootcanal.pl/chapters#2.0

AnnikaTjuka commented 4 years ago

I combed through the GitHub repository and the database. I found a list which would be feasible to add to NoRaRe: lexicon_semcats-glosses_20110913.xlsx

It includes the glosses for each category. Unfortunately, it does not include the label for a category which would make it more explicit. And I couldn't find what rn stands for:

rn gloss semcat CHAP CAT
191741 body 1a/1.1 1a 1.1
39240 lower body 1a/1.1 1a 1.1
1287 naked 1a/1.1 1a 1.1
21200 nudity 1a/1.1 1a 1.1
43127 physique 1a/1.1 1a 1.1
121240 carrion 1a/1.1.1 1a 1.1.1
1030 flesh 1a/1.1.1 1a 1.1.1
2355 flesh-meat 1a/1.1.1 1a 1.1.1
148764 livestock 1a/1.1.1 1a 1.1.1
491 meat 1a/1.1.1 1a 1.1.1

@LinguList Should I add this list or prioritize another one from the repository?

LinguList commented 4 years ago

Yes, this is a good idea. The RN stands for the record number in the STEDT database, it is like the identifier for a given word. You can search them here.

LinguList commented 4 years ago

The URL which this resolves to is https://stedt.berkeley.edu/~stedt-cgi/rootcanal.pl/edit/lexicon?lexicon.rn=

LinguList commented 4 years ago

So I would call this rn column "STEDT_ID", and for the metadata, you can code it as we do for babelnet:

      {
        "name": "BABELNET_ID",
        "datatype": "string",
        "propertyUrl": "dc:relation",
        "valueUrl": "http://babelnet.org/synset?word=bn:{BABELNET_ID}"
      }, 

so you write:

      {
        "name": "STEDT_ID",
        "datatype": "string",
        "propertyUrl": "dc:relation",
        "valueUrl": "https://stedt.berkeley.edu/~stedt-cgi/rootcanal.pl/edit/lexicon?lexicon.rn={STEDT_ID}"
      }, 
AnnikaTjuka commented 4 years ago

Great, thanks! I'll prepare the list accordingly.