NYCPlanning / data-engineering

Primary repository for NYC DCP's Data Engineering team
20 stars 0 forks source link

CEQR Survey outputs #590

Closed damonmcc closed 7 months ago

damonmcc commented 7 months ago

The survey GIS is making must display the flags for all BBLs in PLUTO and the spatial source data that was used in the flag logic.

Goals

notes

fvankrieken commented 7 months ago

Hi @jackrosacker, just wanted to start getting down some specifics around outputs for this and make sure we're on the same page as you. @AmandaDoyle @croswell81 for visibility

We sort of have two broad issues

Output Tables

Breaking these into two categories

Source data/buffers

En route to producing final outputs, we have intermediate tables with buffered geoms. The tables look something like this (imagine those geoms are in 2263) variable id buffered_geom
elevated_railway 14 STREET/CANARSIE LINE POLYGON ((-73.91110564962213 40.685452360859514, -73.91110564954899 40.68545236162799)
elevated_railway 6 AVENUE LINE POLYGON ((-73.91110564962213 40.685452360859514, -73.91110564954899 40.68545236162799)

image

variable corresponding to the unique variable that there can be a flag for, id being a unique identifier of a given variable within its variable type. There are some outstanding questions on how these should be exported

Computed outputs (actual outputs)

We were planning on having our "long" table of flags included, let us know if you'd prefer us to leave it out. In this table, each row is a "flag". Any bbls with no flags will not be found in this table bbl variable variable_id
1000010001 elevated_railway 14 STREET/CANARSIE LINE
1000010001 edesignation_noise 1
1000010002 elevated_railway 14 STREET/CANARSIE LINE

This could also include a "distance" column (for geospatial flags - edes in this case doesn't apply), for distance from noted variable

We are also going to include some form of wide format bbl elevated_railway edesignsation_noise ...
1000010001 14 STREET/CANARSIE LINE 1 ...
1000010002 14 STREET/CANARSIE LINE NULL ...

There was also desire for some explicitly binary output, using 1 or 0 for easy calculation within [name of esri product]

A few questions to be resolved

Other tables

Is there anything missing in above summary? Say, a lookup table of variable, data source it comes from, question it relates to, etc.

Packaging

@damonmcc was under the impression that one gdb might be the most convenient, is that true?

damonmcc commented 7 months ago

GIS thoughts

general

source data in output