autotyp / autotyp-data

AUTOTYP data export
Creative Commons Attribution 4.0 International
38 stars 17 forks source link

Gender dataset problems #12

Open agricolamz opened 6 years ago

agricolamz commented 6 years ago

I looked through the Gender dataset and found that there are no values in the Gender.Presence column, but at the same time there are some values in Gender.n and Gender.bined4 columns:

I also noticed that according these dataset there are no gender in Abkhaz. These looks strange for me.

I'd propose to change Gender.n value to 0 in cases when there is a FALSE value in the Gender.Presence column.

It is also interesting to know, why there are some languages that have TRUE value in the Gender.Presence column, but no value in other columns:

And the last point: why there is no values in any column for Tagalog, LID 362?

I compared this dataset with the WALS 30A feature. From 125 common languages 10 have different values. Please have a look.