datacommonsorg / data

Apache License 2.0
57 stars 105 forks source link

Ingest Google Earth Engine data on-the-fly #1025

Open alxmrs opened 3 weeks ago

alxmrs commented 3 weeks ago

Given #1024, one potential improvement to the data commons could be found in its ingestion of Google Earth Engine data. Raster data in GEE can be loaded into Xarray via Xee (https://github.com/google/Xee). Using a “streaming”, chunked programming model afforded by Dask, raster data can be transformed into tabular data in a parallel, memory bounded way (https://docs.xarray.dev/en/stable/generated/xarray.Dataset.to_dask_dataframe.html).