NYCPlanning / data-engineering

Primary repository for NYC DCP's Data Engineering team
14 stars 0 forks source link

MapPluto subset as a DB #302

Closed TangoYankee closed 6 months ago

TangoYankee commented 8 months ago

Description

Organize a subset of MapPluto into a normalized database with the structure shown below.

Acceptance Criteria

For at least this iteration, Application Engineering is responsible for the creation and maintenance of the database. The role of data engineering will be to generate csvs which mirror the table structures that AE can then COPY into the database. The borough and land use csvs are small enough that it was feasible to create them by hand. So, DE will only need to generate the remaining csvs for tax lots and zoning districts.

Note: Zoning districts are intentionally excluded from the tax lot table. They will be related to tax lots through spatial joins performed when an end user requests zoning information about a tax lot

zoning-erd drawio

Data sources

The majority of data comes from Map Pluto. However, geospatial data on zoning districts will need to be sourced from the zoning districts data set.

Data descriptions

The data fields are sourced and structured as follows

TangoYankee commented 8 months ago

pinging @TylerMatteo for confirmation of details

TangoYankee commented 8 months ago

Will make zoning district PK a UUID

TylerMatteo commented 8 months ago

@TangoYankee Looks good!

Some feedback:

TangoYankee commented 8 months ago

@TylerMatteo Updated the ticket with zoning district id, zoning district class category, and borough title changes

@damonmcc A note on drawio svg v png; the first iteration of the ticket used svg. But, GitHub's rendering was ranging from awkward to unreadable. So, I swapped to png.

TangoYankee commented 7 months ago

@TylerMatteo If we want to account for colors of zoning districts and land use in the database, we could:

1) Break the zoning district category enum into a full table. Then, make a 'color' field on the created zoning_district_category table 2) Add a color field to the land use table

TylerMatteo commented 7 months ago

Based on our IRL discussions:

TangoYankee commented 7 months ago

@NYCPlanning/data-engineering updating zoning_district_class to have a description field

TylerMatteo commented 7 months ago

TODO for @TangoYankee - add a field to the zoning district class ERD for the URL to link to the public website page for that class. We're going to make it nullable to account for cases where we can't build a real link.

TangoYankee commented 7 months ago

add a field to the zoning district class ERD for the URL to link to the public website page for that class. We're going to make it nullable to account for cases where we can't build a real link.

@NYCPlanning/data-engineering I recognize the description I gave for the url field is kinda lackluster but I'm not really sure how to explain it further. I am available for further clarification.