NYCPlanning / data-engineering

Primary repository for NYC DCP's Data Engineering team
25 stars 1 forks source link

DE<>GIS Automations - Phase 1 #281

Closed damonmcc closed 8 months ago

damonmcc commented 1 year ago

This phase covers scoping, exploration, and trying discussed approaches on a couple of datasets (likely to be MIH and COLP).

Glossary

packaging - data and metadata transformations that happen prior to push to other data stores (e.g. Socrata) flow - a movement of data. It could be a push or a pull. metadata -

Tasks

damonmcc commented 1 year ago

OTI PDF which details Open Data self-publishing option: link

alexrichey commented 1 year ago

Also shared, DCAT standards

alexrichey commented 1 year ago

Takeways from meeting with GIS on Oct 13.

alexrichey commented 1 year ago

GDE-Dataflow

Notes:

The scope for this initial phase is to draw a line straight from AGOL to OpenData. Currently OTI receives OpenData updates via email, and OTI would like very much to have this process automated.

alexrichey commented 1 year ago

Options for pushing to OpenData

Misc

alexrichey commented 1 year ago

Agenda:

Progress So Far (Ie, last sprint's work)

Next Steps

What I mean by "Fan Out" Fanout drawio

alexrichey commented 1 year ago

Notes from DE <> GIS meeting (10/23/2023)

Question: Can we implement the fanout diagrammed above?

Next Steps:

Notes / Questions:

croswell81 commented 1 year ago

Adding two diagrams of GIS open data process: Original: image

More recent: when @jackrosacker returns he can add the other diagram.

croswell81 commented 1 year ago

@alexrichey FYI - Alex from OTI open data team shared the following resources:

caseysmithpgh commented 1 year ago

@alexrichey Here's the updated diagram. I did my best to include bits from the whiteboard yesterday, and synthesize some of our thoughts on "phases". Looking forward to hearing your thoughts once we're both back in the office.

casey_proposal_updates

alexrichey commented 1 year ago

Next Steps [As of Nov 20]:

OpenData

alexrichey commented 1 year ago

Next Steps:

Questions:

croswell81 commented 12 months ago

@alexrichey to answer the question, Is there a comprehensive list of DCP datasets on Socrata? - unfortunately its not that straightforward since there is not a one-to-one match of all data on BYTES of the BIG APPLE (BOBA) and NYC Open Data portal.

  1. This is OTI's list they generate as all datasets on the NYC Open Data portal, which can be filtered by agency
  1. Recommend you use this list, which is our list we use to track open data updates and send to OTI. We just added open data UID and URL as fields in this table.
alexrichey commented 11 months ago

Next Steps

Checklist for datasets

Process for GIS to onboard datasets to push to DO

Get Other DE Team members involved

Planning Metadata for Products