bmkramer / 101innovations-survey-data

Stringing beads - identifying research workflows from tool usage data (clustering)
11 stars 5 forks source link

Statistic pitfalls: binary data #19

Open jcolomb opened 6 years ago

jcolomb commented 6 years ago

The data seem to be mainly binary or of categorical nature. Most statistics for high dimensionality data only deal with continuous values. I got into similar problems when trying to analyse some data from a rdm survey. I have not looked much into the question, but it definitively needs specific stats, does anyone knows more about this?

jcolomb commented 6 years ago

ps: link to rdm survey: https://osf.io/54em6/