evamaxfield / lcd

Long tail of Controversial Datasets
0 stars 0 forks source link

General notes #3

Open evamaxfield opened 2 years ago

evamaxfield commented 2 years ago

We can try to separate out a few datasets as strictly educational from potentially prototype / in production (iris and boston housing are typically educational, megaface is typically in production)

Use Student / Academic account labels for GitHub accounts as the scraping basis for detecting dataset use.

Add GitHub username to demographic info as a method to get github info for scraping

Describe in three paragraphs:

evamaxfield commented 2 years ago

For DSSG

What do we imagine the students would be doing? What data would be used, and what are steps for getting / processing the data? Who will be the lead for doing this through the summer?

evamaxfield commented 2 years ago

Roadmap:

evamaxfield commented 2 years ago

understand and know research design for surveys doing a lit review of "who else has studied controversial datasets, what did they find?" doing the conceptual work ourselves -- where are the gaps -- here is how we are going to validate or test

https://www.zotero.org/groups/4508576/bits/collections/VRK3HG8Z

evamaxfield commented 2 years ago

Download zotaro beta build and take notes on the readings in the app

evamaxfield commented 2 years ago

General notes on literature reviews: http://www.raulpacheco.org/resources/literature-reviews/

"Use the literature review as a method for argument for why our research should be done" -- "what is the coverage and what is the gap" -- "no one has asked these questions"


Another approach is finding literature that is similar to what you have done and comparing the results. Common in CS because you may be comparing algorithym performance.


"Practice for the second chapter of your dissertation"


prospective lit review - going out, seeing what exists, how it matches some criteria for relevance (internal) systematic lit review - externalized, specific query terms


For this project: we can use a literature review as an argument for the study. there is a gap in the literature. but it will be hard to find comparable studies. -- this is somewhat of identifying an emerging problem.


potential paramters for vignette:

an additive question about who has access: "you said this was unethical usage, what if users agreed to this use"