Closed damonmcc closed 8 months ago
Also shared, DCAT standards
Takeways from meeting with GIS on Oct 13.
Notes:
The scope for this initial phase is to draw a line straight from AGOL to OpenData. Currently OTI receives OpenData updates via email, and OTI would like very much to have this process automated.
Options for pushing to OpenData
Socrata Gateway Agent (a running process on our servers) that will configure itself by phoning home (to Socrata) and push data on a schedule. It exposes data syncing functionality, but not metadata. Conceivably you could run this on a machine behind our firewall to sync data to Socrata. Probably not worth due to the limitations of syncing just data.
Socrata-py
Fully featured python client. Docs say it hits the Socrata Data Management API
but it hits the publishing REST API as well. Our best option. I've tried pushing datasets, metadata, etc. and it's straightforward.
Publishing API Umbrella for a few different APIs (SODA Producer API, Dataset Management API, Import API)
Misc
Agenda:
Now that we've reasonably proofed things out, and I understand the current state at DCP, I'd like to revisit the original proposal (daisy chaining ESRI -> OpenData). If we still think that's the best option, then let's go ahead and implement a Github Action to sync data to OpenData.
However, I wonder about a different option to simplify the flow of data, namely pushing from SDE back to Digital Ocean, then fanning out to OpenData and ESRI at the same time. From the perspective of Data, Metadata, and versioning, this seems ideal. But as I understand it, interaction with SDE/arcpy is somewhat auto-magical. (To Simplify: Should we Fan-Out, or Daisy-Chain?) Also to consider: There's some time pressure from OTI to automate things. So that might inform our decision about what's a first good step.
What I mean by "Fan Out"
Notes from DE <> GIS meeting (10/23/2023)
Question: Can we implement the fanout diagrammed above?
Next Steps:
Notes / Questions:
Adding two diagrams of GIS open data process: Original:
More recent: when @jackrosacker returns he can add the other diagram.
@alexrichey FYI - Alex from OTI open data team shared the following resources:
@alexrichey Here's the updated diagram. I did my best to include bits from the whiteboard yesterday, and synthesize some of our thoughts on "phases". Looking forward to hearing your thoughts once we're both back in the office.
Next Steps [As of Nov 20]:
OpenData
Next Steps:
Questions:
@alexrichey to answer the question, Is there a comprehensive list of DCP datasets on Socrata? - unfortunately its not that straightforward since there is not a one-to-one match of all data on BYTES of the BIG APPLE (BOBA) and NYC Open Data portal.
Next Steps
edm-distributions
This phase covers scoping, exploration, and trying discussed approaches on a couple of datasets (likely to be MIH and COLP).
Glossary
packaging
- data and metadata transformations that happen prior to push to other data stores (e.g. Socrata)flow
- a movement of data. It could be a push or a pull.metadata
-Legislative Compliance
section, or contact info)Tasks