NYCPlanning / ae-data-flow

Data pipelines to populate Application Engineering databases
1 stars 0 forks source link

Load sources and tax_lot with explicit geometries #16

Closed pratishta closed 3 months ago

pratishta commented 3 months ago

Part of https://github.com/NYCPlanning/ae-data-flow/issues/4

This PR does a few things:

  1. Give both data sources explicit geometry with an SRID in sql/load_sources.sql
    • pluto.csv's geometries look like li_ft "MULTIPOLYGON (((980898.728373699 191409.779249711... i.e. SRID=2263
    • zoning_districts.csv geometries look like wgs84 "MULTIPOLYGON (((-74.0075330216537 40.6277222566588... i.e. SRID=4326
  2. Update tax_lot table to import from source_pluto and transform geometries where necessary
  3. Remove BEGIN AND COMMIT from import_table.sql and update the README

This should allow you to load the source data, create the tables, and populate the tables (using the commands from updated README)