bpfree / stellwagen_cable

Cable routing analysis for Stellwagen National Marine Sanctuary
0 stars 0 forks source link

Stellwagen Cable Routing

Cable routing analysis for Stellwagen National Marine Sanctuary

Point of contact

Repository Structure

Note for PC users: The code was written on a Mac so to run the scripts replace "/" in the pathnames for directories with two "\".

Please contact Brian Free (brian.free@noaa.gov) with any questions regarding the code.

Study region

Data sources

Layer Data Source Download Link Metadata Notes
Ocean disposal sites NOAA / OCM Ocean disposal sites Metadata MarineCadastre
Intertidal flats Massachusetts CZM Intertidal Flats Metadata RESTService
Preliminary offshore sand resources APTIM Technical Report No. 631226219 (2018) Massachusetts CZM Sand patches Metadata RESTService
National Channel Framework - Channel Area (ACOE) CZM MORIS Channel areas Metadata RESTService
Anchorages NOAA / OCM Anchorage areas Metadata MarineCadastre
Habitat Northeast Ocean Data Eelgrass meadows Metadata ---------------
Cable and pipelines --------------- Cable and pipelines Metadata ---------------
Submarine cables --------------- Submarine cables Metadata ---------------
Neptune LNG Pipeline Massachusetts CZM Liquid natural gas Metadata RESTService
Northeast Gateway LNG Pipeline Massachusetts CZM Liquid natural gas Metadata RESTService
Algonquin Hubline LNC Pipeline Massachusetts CZM Liquid natural gas Metadata RESTService
CONMAPSG USGS CONMAPSG Metadata (text) FAQ, Data provided for the analysis, by Brooke Hodge, are more expansive than this dataset (report)
Shipping lanes NOAA Boundary Metadata Northeast Ocean Data portal
Seascape derivatives --------------- Slope ---------- ---------------
Stellwagen National Marine Sanctuary NOAA Sanctuaries Boundary Map

Data for the gravel sediment came from Michael Thompson. As stated from him: "The sediment layer was derived collaboratively with a sanctuary led working group (USGS/NOAA Fisheries/NEFMC/UConn/UMass/BU/MA CZM & Stellwagen researchers) that created the 3 class breaks from the multibeam backscatter data with ground truthing and was well vetted for use in the NEFMC habitat amendment works (including their SASI model). The boulder ridges were a polygon layer provided by USGS which were rasterized and added to the sediment raster to give it a 4th value (1-mud, 2-sand, 3-gravel, 4-boulder)."

Value Score Classification
1 - 75 1 Mud
76 - 165 2 Sand
166 - 255 3 Gravel
NA 4 Boulder

Data for the Stellwagen Critical Sand Lance Habitat polygon additional were shared by Michael Thompson for inclusion into the model's third iteration. These data were shared since this dataset "covers most major areas but not all of the sand lance that are fundamental to the sanctuary."

The slope data required opening the data in ArcGIS Pro (version 3.0.2) and exporting the raster data from the geodatabase to usable formats in R (.GRID and .TIFF). Before a linear regression was applied, only slope values within the study region got extracted; this kept the rescaled values between the minimum and maximum slope values possible within the study area.

Methods

Costs

Layer Buffer distance (m) Score Notes
Sediment (CONMAPSG) - Sand/Mud -------------- 0.2 In a second iteration score reduced to 0.1
Sediment (CONMAPSG) - Mix -------------- 0.4 In a second iteration score reduced to 0.1
Sediment (CONMAPSG) - Gravel -------------- 0.5 In a second iteration score reduced to 0.1
Sediment (CONMAPSG) - Rock -------------- 1.0 In a second iteration score reduced to 0.9, a third further dropped the score to 0.6
Active and Inactive Disposal Sites 500 0.8 ---------------
Intertidal Flats 500 0.2 ---------------
Sand Patches 500 1.0 ---------------
Channel Areas 600 1.0 ---------------
Anchorage Areas 600 0.8 ---------------
Eelgrass Meadows 675 0.4 ---------------
Cable and Pipeline Areas 675 0.4 ---------------
Submarine Cables 675 0.4 ---------------
LNG Pipelines 675 0.4 ---------------
Mud -------------- 0.4 In the third iteration, this dataset got added
Sand -------------- 0.6 In the third iteration, this dataset got added
Gravel -------------- 0.5 ---------------
Boulder -------------- 0.6 In the third iteration, this dataset got added
Boulder Ridges 100 0.9 Moved from barrier to cost in last two iterations
Slope -------------- Linear regression ---------------

The CONMAPSG sediment data had the Stellwagen National Marine Sanctuary boundary removed; thus no data existed for the entirety of the national marine sanctuary.

Barriers

Layer Buffer distance (m) Notes
Coral points 675 Provided by MA CZM, not included in later iterations as a barrier
Sites to avoid 500 Provided by NMS
Boulder ridges 500 Provided by NMS, moved from barrier to cost in last two iterations of model
Cape Cod shore 500 Generated by MA CZM

Three different barrier rasters got made:

  1. One with all 4 datasets (coral points, sites to avoid, boulder ridges, and Cape Cod shoreline)
  2. One with only 3 datasets (sites to avoid, boulder ridges, and Cape Cod shoreline) -- coral points were removed
  3. One with only 2 datasets (sites to avoid and Cape Cod shoreline) -- coral points and boulder ridges were removed

Points of connection

This analysis used the same ending points for all cable routing scenarios. Starting points got split into five separate options:

Stellwagen

Stellwagen National Marine Sanctuary (data and map) got split into two sections to determine how cable routing would get affected if limited to a northern or a southern route. The regions got split around the "traffic separation schemes" that influence vessel traffic into and out of Boston Harbor. Three separation schemes split were removed from the study area and left north and southern sections. Each section became barriers to the cable routing analysis; thus each got separately removed from the Stellwagen National Marine Sanctuary to create two new cost rasters.

Least cost path

This analysis first leveraged the "Least Cost Path" toolkit in Esri's ArcGIS Pro version 3.3. The tool works by summing all the "costs" moving from some identified starting point(s) to other identified ending (points). The costs are the sum of data layers overlapping in a predefined study region to create the cost surface -- think of this as a board where moving a piece is more challenging if certain areas are a forest, mountain, or sand, so going a slightly longer way on an easier path or a short way through a challenging part will be the take less effort.

When a path with a set width was chosen as the desired output, the analysis used the "optimal corridor connections" toolkit in ArcGIS Pro version 3.3. Its output is a corridor of a pre-determined width (500m around the points to create a 1000m-wide corridor). For the tool to work, two actions had to get taken. First, a new feature class (a geodatabase got created and populated with template features for the four paths (each lease to Boston or Plymouth)) got [created](https://pro.arcgis.com/en/pro-app/latest/help/data/feature-classes/create-a-feature-class.htm. Geopackages in QGIS have the ability to exist in 64-bit, but ArcGIS cannot handle 64-bit data. So new features classes got created to exist in 32-bit (set OIDType to equal 32-bit and keep the coordinate reference system the same as the original data (NAD83 19N)). The original data can then get appended the start-end points for each path to their respective newly created feature class within the geodatabase. Each path had its own corridor run, as the tool when multiple starting and ending points are given, creates corridors in a particular order instead of the desired one starting location to the two end locations and then for the next lease starting location.

Four scenarios got investigated in this analysis for each starting option:

  1. Normal barriers -- all data layers with costs got summarized to a simple cost raster that had the 4 barriers removed (coral points, sites to avoid, boulder ridges, and Cape Cod shore) -- then 2 barriers (sites to avoid, Cape Cod shore)
  2. North barrier -- the normal scenario with an additional area removed that was the southern section of Stellwagen National Marine Sanctuary (as defined by as part of the marine sanctuary that fell south of the traffic separation schemes from the shipping lanes dataset)
  3. South barrier -- the normal scenario with an additional area removed that was the northern section of Stellwagen National Marine Sanctuary (as defined by as part of the marine sanctuary that fell north of the traffic separation schemes from the shipping lanes dataset)
  4. TSS -- the normal scenario with both the northern and southern sections of Stellwagen National Marine Sanctuary (as defined by as parts of the marine sanctuary that fell north and south of the traffic separation schemes from the shipping lanes dataset)

It should get noted that the leastcostpath package in R had been investigated as a possible option, but it was ruled out as a viable option till more work can get done (GitHub repository, CRAN, reference manual).

Least cost path

Model iterations:

2 October 2024 Iteration Starting Point Ending Point Costs Barriers Extent
Iteration 1 0564 & 0567 (edges) Boston & Plymouth Normal No coral points Full
Iteration 2 0564 & 0567 (edges) Boston & Plymouth Normal No coral points North
Iteration 3 0564 & 0567 (edges) Boston & Plymouth Normal No coral points South
Iteration 4 0564 & 0567 (edges) Boston & Plymouth CONMAPSG update No coral points Full
22 October 2024 Iteration Starting Point Ending Point Costs Barriers Extent
Iteration 1 0564 & 0567 (edges) Boston & Plymouth (1000m separation) Sediment updates No coral points nor boulder ridges Full
Iteration 2 0564 & 0567 (edges) Boston & Plymouth (1000m separation) Sediment updates No coral points nor boulder ridges North
Iteration 3 0564 & 0567 (edges) Boston & Plymouth (1000m separation) Sediment updates No coral points nor boulder ridges South
Iteration 4 0564 & 0567 (edges) Boston & Plymouth (1000m separation) Sediment updates No coral points nor boulder ridges TSS
Documented issues

7 August 2024

  1. What is happening with the coral points data? Data source is unknown and data shared seem to have a 175-meter buffer already applied, yet the analysis details a 675-meter buffer. [Update on 12 August 2024: per Brooke Hodge's response on the data source being from NOAA's deep-sea coral dataset and supplemented from another data source, strangely there are instance where some data from the deep-sea coral dataset do not get included in the previously shared coral point data, but also there exist data in the dataset that do not appear in the previously shared data.

Put simply, the previously shared data lack some of the deep-sea coral dataset provided by NOAA but also have additional supplemental data. How are these two conditions reconciled?]

8 August 2024

  1. Submarine cable area data by Northeast Ocean Data differ than the data provided by MarineCadastre. Northeast Ocean Data provided data got integrated due to past use in previous model iterations. However, the metadata document suggest they are originally from ENC Direct (see map services). When comparing data between the two providers there exist differences and discrepancies. The document does not detail enough why these discrepancies exist or how to reconcile them.
  2. Northeast Ocean Data cable and pipeline data differ from data hosted on MarineCadastre. The metadata document suggests the Northeast Ocean Data data got created by combining ENC Direct data at the approach and harbor data for the submarine cable areas and the pipelines. Still unclear why and how differences exist across the datasets.
  3. What is happening with the "G or R" data within the CONMAPSG dataset? It did not get a classification? Neither has "R". Should "G or R" get called gravel or both classified as rock? [Update 12 August 2024: per Brooke Hodge, both can get classified as "rock".]
  4. Previous analysis used a active and inactive disposal sites dataset seems to have one fewer objects than the current available dataset. Are the previous data just old? Something off about the new data? [Update on 12 August 2024: per Brooke Hodge the additional area is a temporary extension, but the analysis can get run with the currently available data.]

9 August 2024

  1. Past analysis had a coordinate reference system that has units in US survey foot. (NAD83 UTM 19N). According to NIST, US survey feet one should avoid using US survey foot units. This seems to make instinctive sense to me given all the setbacks are in meters not imperial units. [Update 7 August 2024: per Daniel Sampson the preferred coordinate reference system is NAD83 with a EPSG code of 26986; Update 12 August 2024: per Brooke Hodge, a coordinate system for UTM19N would likely be fine to use]
  2. No active nor inactive disposal sites were located within the study region. The previous analysis seems to not have limited to only the active nor inactive disposal sites, but all disposal sites. On the Northeast Ocean Data portal for "Inactive" and "Active" ocean disposal sites the status are recorded as those: "inactive" and "active". The downloaded data, however, from Marine Cadastre have additional statuses: "approved", "discontinued", and "unknown". Some of the statuses in the data from Northeast Ocean Data portal labeled as "inactive" are qualified as "Discontinued" in the Marine Cadastre dataset. How have "approved", "discontinued", and "unknown" been relabeled as "active" and "inactive"?

3 September 2024

  1. Shoreline limit for Cape Cod seems to come from the USGS global islands dataset. Is this dataset preferable to the Massachusetts Coastal Zone Management dataset on shorelines between 1800 - 2014? The USGS global islands dataset got published in 2019 -- data were from 2014 as well. Which is the more authoritative data source?
  2. For the sites to avoid, most of the entries are wrecks. Publically available datasets for shipwrecks include: AWOIS Wrecks, ENC Wrecks, and AWOIS Obstructions. Other data included in the sites to avoid have descriptions of: "Boulder In", "Mud In", "Sand In", "Gravel In", "Sandtrap", and "NoiseReference". Cannot get determined the provider of the latter data.
  3. What was the provider of the boulder ridges data?

11 September 2024

  1. How should cells that do not have any data from a particular dataset get analyzed? It makes the most sense to give value of 0 as there is no cost for that particular location for that particular dataset.
  2. How were start and end points chosen? Any particular coordinates or just randomized?
  3. Which data did the analysis to split Stellwagen National Marine Sanctuary into north and southern regions? [Update 11 September 2024: per Alyssa Randall the data came from traffic lane layer on Northeast Ocean Data portal. Those data direct you to data hosted on ENC Direct.]

1 October 2024

  1. Should the CONMAPSG gravel data and NMS sediment get combined or remain separate? [Update 1 October 2024: per Alyssa Randall the data are remaining separate]
  2. Should the CONMAPSG gravel data get a value of 0.5 from the present 0.6? [Update 1 October 2024: per Alyssa Randall the data will get a value of 0.5 to keep consistent with the suggestion by NMS]

21 October 2024

  1. Should there exist two submarine cable datasets or get different values? [Update 21 October 2024: per Brooke Hodge and Michael Thompson only a single dataset gets integrated; Sanctuaries did not have dataset, so keeping value as requested by Massachusetts, which is 0.4]

23 October 2024

  1. Alice Stratton and National Marine Sanctuary requested that boulder ridge setback distance get reduced from 500m to 100m and the setback for sites to avoid get reduced from 1000m to 500m. How a 1000m distance was previously chosen remains unclear.