hackoregon / 2019HackORDataScienceTemplate

Template to get the 2019 data science parts of a Hack Oregon project started :)
MIT License
2 stars 0 forks source link

explore housing tax lot data processing options with Stephen #18

Open karenng-civicsoftware opened 5 years ago

karenng-civicsoftware commented 5 years ago

discussion to be had

@stephenosserman has been exploring and wrangling our taxlot data in our housing-staging postgis db and been making some progress. So far I've I appended census-block to each lot for each year (1997-2017); standardized the taxlot-id format; and added an index here and there to make querying faster. I've also written the queries to identify most all changes from year to year -- including lot divisions, merger of multiple lots, changes in lot-id without lot geography changes, and (most) other boundary changes. I'm hoping all of this will make longitudinal analysis of taxlot data a lot easier and more accurate. The brainstorm prompt: What analyses do you think we and/or future folks might want to run using this dataset? Context is that there are a bunch of possible table-structures I could imagine using for storing the complex taxlot changes dataset I'm pulling together. I'd like actual analyses that people might run in the future to inform which table structure(s) I go with. Specifically hoping to work backwards from potential analyses to pseudo-sql to see which approaches might be best combination of comprehensive, streamlined, and flexible. We have a few concrete uses for existing housing-team work this year which I'm thinking about, and I'll seed a thread with a few other ideas, but would love more ideas. Thx in advance!

some possible types of analyses to kick off brainstorming: