azavea / osmesa

OSMesa is an OpenStreetMap processing stack based on GeoTrellis and Apache Spark
Apache License 2.0
79 stars 26 forks source link

Add facilites for producing ORC files needed for bulk ingest #199

Closed jpolchlo closed 3 years ago

jpolchlo commented 3 years ago

When reconstructing the staging DB, it is necessary to produce an ORC file for the planet history as well as to update a changeset ORC file. The former task was previously left to a manual process. This PR adds automation to help with this, and combines it with the changeset ORC update. Now, from ./deployment/batch/makefile, make prepare-bulk-ingest will create a cluster and add steps for both the planet history ORC conversion and the changeset ORC update. The cluster will terminate automatically after the steps terminate.