gvdr / DATA471

Open Repository of Knowledge for DATA471
GNU General Public License v3.0
7 stars 2 forks source link

Data Carpentry #1

Open wrightaprilm opened 6 years ago

wrightaprilm commented 6 years ago

Is the idea of this segment to discuss documenting the workflow, for how the data were used? I don't know some of the terms that are on the syllabus, but that's what I'm picking up as a core component - how can a stakeholder verify that what has been done with the data is what was agreed to? With facebook, there's so much obfuscating language, third-party agreements that this is nearly impossible, but for smaller scale applications, that may not be the case.

gvdr commented 6 years ago

Sorry for the late answer. And THANKS for the feedback!

The "Carpentry" stuff comes from Ian Bogost's idea that "philosophy" is building stuff. I would like my students to discover a bit of "Object Oriented" (ontology/feminism). The core take home message is that everything, also models, algorithms, data, are "objects", and we never fully understand them. We can explore and investigate them by inter-acting with them, but our knowledge is always limited.

There are a couple of practical consequences: 1) do not believe that your model fully capture something or somebody (i.e., do not reduce a person to their representation in your model) 2) plot your data and play with your models in exotic ways if you want to understand more what they are doing: think about the Anscombe's quartet or The Datasaurus Dozen, you can interpret those either as "strange data" or as a limitation of the diagnostics we use for linear models.

wrightaprilm commented 6 years ago

Oh, that's interesting. Do you think you'll discuss here the importance of teams that don't all come from one perspective and background, or will that be elsewhere?

gvdr commented 6 years ago

Sure! The idea is to discuss it in the "Diversity & Inclusiveness" lecture. At the moment it comes before the Object Oriented lecture. Maybe it would be better after?

In terms of hands-on, we will use material inspired by the Heather Krause's lectures on Feminist Data Analysis: https://app.ruzuku.com/courses/25230/about