PermafrostDiscoveryGateway / pdg-portal

Design and mockup documents for the PDG portal
Apache License 2.0
0 stars 0 forks source link

Show surface water raster layer #17

Open robyngit opened 2 years ago

robyngit commented 2 years ago

Chandi & Rajitha have processed and uploaded the relevant raster images to the NCEAS datateam server. Here are the steps to find the surface water data:

robyngit commented 1 year ago

Notes:

julietcohen commented 5 months ago

Since rearranging the Datateam server, the data is now located at: /var/data/submission/pdg/ice-wedge-polygon-data/version_01/high_ice/

The dates for the subdirectories directories are:

There is also another data directory that contains surface water rasters: /var/data/submission/pdg/surface_water/water_processed/ This directory's date is October 2022, so it is newer than the other directory, and it does not contain subfolders for the Alaska, Canada, and Russia. It contains 155 raster files. It would be interesting to determine if these are the same as in the other directory, and just a newer version.

julietcohen commented 5 months ago

Importantly, the newest version of the IWP data (more recent than version_01) is stored in /var/data/submission/pdg/ice-wedge-polygon-data/version_2023-01-31/ and within that dir we made a subdir to archive all of the files Elias used to pre-process the IWP data (after detection, before passing on to the viz team). This subdir is: pre_processing_materials/cleaning_data and contains shapefiles for surface water:

There is a README file as well, but it is unfortunately not helpful, because it directs the user to https://nsidc.org/data/ggd318.html which does not exist anymore. So understanding metadata for those files might need to come from Elias. Alternatively, there may be a way to determine the new URL for the page that doesn't exist anymore from the National Snow and Ice Data Center.

My best guess is that these two shapefiles should be the input to the viz workflow to produce the surface water layer, because it is the most recent version 🎉

julietcohen commented 4 months ago

Chandi confirmed that the water masks in raster format that came with the older version of the IWP data are not the same as the few vector waster masks that came with the newer version of the IWP dataset. So Chandi suggested that his team should send over a gdal script that converts the rasters to vectors, and that way I can do the conversion for the water mask rasters, and process those output files in the viz workflow that is configured to only ingest vectors at the moment. The team also decided it would be potentially beneficial to integrate that gdal code into the viz-staging package, if we want to pursue to route of converting rasters to vectors for other datasets as well, as an alternative to adding the ability to process rasters as input directly to the workflow.

I do want to investigate the accuracy of converting rasters to vectors, because from my experience vectorizing rasters, the output vector data is not as accurate as the data would be if it was originally in vector format. For example, adjacent polygons in a raster may be grouped into one larger polygon when it is vectorized. Which would result in less accurate statistics for # of polygons in a tile, but would be similar accuracy for % coverage of a tile.