Open-NC / Are-We-Represented

Main repository for the Are We Represented? project
4 stars 1 forks source link

Design overall data schema #3

Open ejaxon opened 6 years ago

ejaxon commented 6 years ago

The schema should certainly be guided by data source schemas, but the goal here is to think long-term about how we want to represent this data for maximum flexibility and usability. This will guide the preparation of each individual dataset from its source form.

ChrisTheDBA commented 6 years ago

I'd recommend looking into Adam Damico's census/survey work - https://github.com/ajdamico. He favors a schema-at-read method

LauraMB12 commented 6 years ago

ryan thornburg said he might can refer some students to help- for their data visualization portfolios.

LauraMB12 commented 6 years ago

ryan shared this article / visualizations on residents v. police https://www.nytimes.com/interactive/2014/09/03/us/the-race-gap-in-americas-police-departments.html

ejaxon commented 6 years ago

I think we need to look at this as a more traditional schema since our next phase will be to build a web app that reads it. To my mind, we basically need the following tables (based largely on the fact that the data sources are quite different):

I've created a schema file to think through columns.