NYCPlanning / data-engineering

Primary repository for NYC DCP's Data Engineering team
20 stars 0 forks source link

FacDB: pick recipe #690

Closed sf-dcp closed 1 month ago

sf-dcp commented 6 months ago

While getting Shadow category recipes for the Green Fast Track product, I realized that one of the needed datasets, NYS Parks Property, may be already present in data library (nysparks_parks). I did end up pulling the requested data from a different source chosen by the GIS into a new recipe nysparks_parks_polygons as these datasets had different geometries and different row count.

Motivation

FacDB uses nysparks_parks recipe and we should re-evaluate whether we continue using this recipe or should switch to nysparks_parks_polygons instead.

Info on the recipes:

Timeline

Until next FacDB build (May 2024?)

alexrichey commented 5 months ago

Same situation for nysparks_historicplaces. This is a Socrata datasource, but the ESRI version is more recently updated, with more rows. We use the newer version nysparks_historicplaces_esri in GFT, and need to swap it out in FacDB as well.

damonmcc commented 2 months ago

gonna make the change to use nysparks_parks_polygons and nysparks_historicplaces_esri as part of the 24v2 build (data update issue)