Closed robyngit closed 2 years ago
It looks like the footprints of the source satellite imagery have already been recorded, along with other metadata about the imagery, at least for a portion of the data.
For example, here is a plot of the file pdg/data/Arctic-Imagery/high_ice/4519_1May2021/Chandi_Alaska_Imagery_2021apr22.shp
from the datateam server:
Along with an example of some of the metadata available:
A file like this is exactly what we need to 1) Identify the footprint for a given file (since the geometry of the footprint is matched to the shapefile with the s_filename
property); and 2) rank files according to preference (we could use one of the properties in the footprint file, e.g. newest acq_time
)
After inspecting this file, I think it might be worthwhile to remove some of the duplicate files before we run everything through the viz pipeline for the first time. Here is a close up of the footprint file where there are > 10 files overlapping:
Chandi_Alaska_Imagery_2021apr22.shp
file don't exactly match the current version of the IWP shapefiles. It looks like maybe the original files have been clipped. Here are a few examples where each overlapping file is in red and blue, and the boundaries from the Chandi_Alaska_Imagery_2021apr22.shp
file are shown in green:
To make a spatially consistent product, we would like a deduplication method that keeps polygons from only one file in areas where two or more files overlap. In these areas of file overlap, we should remove all polygons that are not from the preferred file (e.g. the newest file.)
We should think about the following when coming up with a solution:
Date
in the Ice Wedge Polygons case), or we would need to calculate the mean/max/min/etc. of the property for all polygons within the file and compare. (And in that case, should we compare across the entire file, within tiles, or within areas of overlap?)