MAAP-Project / maap-hec-aws

2 stars 1 forks source link

GEDI-Subsetter for Stage-In Caching #144

Closed jjfrench closed 1 year ago

jjfrench commented 1 year ago

Issue Mapping

#349

Description

For the stage-in caching approach, files are cached before being passed to GEDI-Subsetting. Meaning that the part of the GEDI-Subsetter that performs the CMR query based on the AOI input needs to happen before the GEDI-Subsetter code.

Create a new branch in the GEDI-Subsetter repo where an algorithm input takes a list of files to be subsetted instead of searching in a CMR (CMR query section of code can be removed).

Acceptance Criteria:

Checklist:

jjfrench commented 1 year ago

Completed a version of GEDI-Subsetting where a file algorithm input, containing a path to an h5 file per line, is used to subset granules instead of querying CMR to gather the list.

Tested by downloading granules locally, removing the CMR query code, adding the --granules algorithm input, and then passing in the input file containing paths to the local granules to create the subsetted gpkg

Passed off relevant information to Max for feedback

repo url: https://repo.ops.maap-project.org/data-team/gedi-subsetter.git branch: hec

L4A Example Inputs

--aoi "path/to/aoi.geojson"
--beams "power"
--columns "agbd, agbd_se, sensitivity" --query "l2_quality_flag == 1 and l4_quality_flag == 1 and sensitivity > 0.95" 
--lat "lat_lowestmode" 
--lon "lon_lowestmode" 
--granules "path/to/granules/file.txt"