autotyp / autotyp-data

AUTOTYP data export
Creative Commons Attribution 4.0 International
38 stars 20 forks source link

Locus #43

Open balthasarbickel opened 2 years ago

balthasarbickel commented 2 years ago

In LocusOfMarkingPerLanguage.csv there are 12 unnamed items in LocusOfMarkingBinned5ForPOSSDefault.

tzakharko commented 2 years ago

I assume you are referring to the NAs in that column? That's because some languages don't have a POSS exemplar, e.g.

filter(GrammaticalMarkers, LID == 116) %>% count(MarkerExemplar)
# A tibble: 3 × 2
  MarkerExemplar     n
  <fct>          <int>
1 Case               1
2 Negation           1
3 NA                 1

filter(GrammaticalMarkers, LID == 114) %>% count(MarkerExemplar)
# A tibble: 5 × 2
  MarkerExemplar     n
  <fct>          <int>
1 Case               1
2 Negation           1
3 Noun Plural        1
4 Tense              1
5 NA                 9

Full list of languages lacking POSS exemplar (but having other data in GrammaMarkers):

LID Glottocode Language
114 ingu1240 Ingush
116 itel1242 Itelmen
169 mait1250 Maithili
192 movi1243 Movima
337 kich1262 Quiche
340 russ1263 Russian
519 brib1243 Bribri
629 chan1310 Chantyal
662 apat1240 Apatani
663 drun1238 Drung
722 nort2722 Qiang (Northern)
1706 kadi1248 Kadiwéu

This will need to be fixed at the database data entry level.