ml5js / Intro-ML-Arts-IMA-F19

Syllabus for Introduction to Machine Learning for the Arts at IMA / Tisch / NYU.
MIT License
165 stars 20 forks source link

Guide to Ethical Data Collections Practices #59

Open shiffman opened 4 years ago

shiffman commented 4 years ago

The question came up today in class: "What if I want to collect data? Is there a helpful guide / document of tips / common strategies for ethical data collection?". Please add your suggestions here:

Also, nothing these two topics I referenced:

Duke University MTMC

Atlanta Asks Google Whether It Targeted Black Homeless People

ellennickles commented 4 years ago

The Datasheets for Datasets paper (mentioned in #10) advises that dataset creators answer ~60 questions (!) regarding motivation, curation (composition, collection, and data cleaning), and integration (uses, distribution, and maintenance).

Would it makes sense to focus on a select number of these, especially those questions related specifically to data from other people? I copied these verbatim, but we can edit further…

(Motivation)

(Composition)

(Collection Process)

(Uses)

(Maintenance)

The full list is here. Of note, this paper is frequently referenced in the Partnership on AI’s About ML project which ultimately aims to establish documentation standards across industries for the transparency of entire ML systems—both datasets and models.

shiffman commented 4 years ago

Thank you so much @ellennickles, this is fantastic. (And thank you for summarizing, super helpful.) I plan on discussing this in class tomorrow!