dhardestylewis / terrain_aggregator

Workflow to aggregate terrain imagery at scale to a single seamless image dataset
Other
10 stars 4 forks source link

free-form description to incorporate into `README.md` or other publications #75

Open dhardestylewis opened 1 year ago

dhardestylewis commented 1 year ago

free-form description to incorporate into README.md or other publications

Work thus far

With recent advances in the coverage of high quality public domain aerial terrain imagery, civil designers, surface water modellers, and others can conduct a majority of their analyses directly from available terrain imagery before purchasing a more highly detailed land survey, if at all.

It is a common need and common practice today for practitioners to aggregate terrain imagery from a variety of aerial imagery sources, often received as datasets with differing spatial resolutions and map projections, and with differing and sometimes overlapping spatial coverage areas. In order to undertake their engineering analyses, it is usually necessary to use a single terrain dataset with the same spatial resolution and map projection throughout, covering the entire spatial extent of the area under analyses. For each area under analyses, practitioners conduct a similar series of pre-processing steps involving spatial reprojection, mosaicking, and retiling in order to produce a seamless terrain dataset with a single set of characteristics such as projection & resolution.

This is the order to scale these pre-processing steps beyond the local areas they are usually undertaken: 1) separate all available imagery tiles into collections by the following characteristics:

This workflow is conducted on the imagery tile basis. There is an alternative workflow which is conducted on a pixel-by-pixel basis:

The key claim of this work is that spatial inaccuracies introduced by this generalized imagery tile aggregation workflow are minimal enough that the resulting data is as usable as the source data for the vast majority of analyses conducted. This claim relies upon the high quality (<3m) of the most recent generations of Lidar imagery and the most common analyses's tolerance for inaccuracy in the vast majority of their use-cases.

Each step of this aggregation workflow can be divided geographically into a computational grid, with each cell corresponding to a geographic extent that is a subset of the extent of all available imagery. Each cell can be separately computed in parallel on each CPU of each node.

Because the aggregation workflow can be computed independently for separate geographic extents, this same workflow can be applied to update sections of the final single seamless terrain tileset whenever new source data becomes available.

Vision

A web interface will serve this highest-resolution tileset. A separate and completely parallel (1-to-1) set of visualization tiles with reduced quality necessary for visual display over the Internet will be produced.

👉 This interface will include togglable semi-transparent layers describing visually on the map itself for each tile from the single seamless terrain dataset

👉 Another semi-transparent layer will describe the spatial inaccuracy of each pixel relative to its source dataset.

👉 Another semi-transparent layer will be a polygonal depiction of the best available source imagery tilesets at each point.

This interface will allow for search by but not limited to

This web interface will support downloading & receiving feedback for any selections above.

To support downloading these custom selections, aggregations of these tiles will be produced on request (on-the-fly) either for immediate download (if selection is small enough) or for later download by temporarily provisioned link.

A toggle will be provided to support downloading at different spatial resolutions, either fixed or custom. Since the single seamless terrain dataset is a maximally up-sampled terrain dataset representing the best available data at any given geographic location, all other spatial resolutions provided will be down-sampled from this single highest-resolution dataset.

This web interface will support on-the-fly requests to generate digital elevation tilesets for selections above directly from all available source Lidar point clouds, at a specified resolution and using a selected resampling algorithm.