PermafrostDiscoveryGateway / viz-staging

PDG Visualization staging pipeline
Apache License 2.0
2 stars 1 forks source link

Identify and remove duplicate polygons that exist in 2+ input vector files #1

Closed robyngit closed 2 years ago

robyngit commented 2 years ago

Input vector files sometimes comprise overlapping areas where the same geographic feature is represented in each file, as is the case with the Ice Wedge Polygons. Where this overlap occurs, the resulting vector tiles contain duplicated polygons, which result in artificially inflated polygon counts and area measurements in the viz-raster steps.

To handle overlap, we decided to remove the duplicate polygons where the distance between centroids is smaller than some threshold. Specifically:

It was decided that 5 meters is likely a good threshold e for the Ice Wedge Polygons

robyngit commented 2 years ago

All deduplication options are documented here:

https://github.com/PermafrostDiscoveryGateway/viz-staging/blob/b6f787963b6299602ecfec7d844e4f385e8c4436/pdgstaging/ConfigManager.py#L169-L213