cloudnativegeo / mlrd

Machine Learning Ready Data
2 stars 1 forks source link

How to package MLRD? #1

Open kbgg opened 1 year ago

kbgg commented 1 year ago

The idea for packaging MLRD training data is to have a STAC catalog which has a collection of STAC items which represent each label chip. There's an asset in the label STAC item which points to the actual label vector or raster file and then there's two options for linking the source imagery:

  1. Link to each source imagery STAC item from. a STAC API which covers the spatial and temporal extent of the label item
  2. In the properties, define the APIs and collections to query for source imagery items matching the spatial and temporal extent of the label item and have the clients run the query and dynamically fetch the STAC items

I'm of the opinion that the second option would provide more compact catalogs and require less effort for STAC catalog generation

doug-newman-nasa commented 1 year ago

Beth is interested! Doug will ride shotgun.

doug-newman-nasa commented 1 year ago

Is a MLARD set of assets a separate collection or part of an existing one?