NASA-IMPACT / veda-data-pipelines

data transformation - ingestion - publication pipelines to support VEDA
Other
13 stars 6 forks source link

Data transfer to MCP #145

Closed slesaad closed 2 years ago

slesaad commented 2 years ago

Epic

118

Description

One of the most important tasks in migration of VEDA to MCP is migration of the data files to the MCP s3 bucket.

Based on decisions made in #118, it's been decided that this will be carried out by running the data transformation and ingestion pipeline with a flag set to transfer the data to the MCP s3 bucket.

A role based policy will be used to gain access to the MCP s3 bucket from the ingestion pipeline.

Examples

The current data link points to a UAH bucket. That's where the data files exist.

{
  "assets": {
    "cog_default": {
      "href": "s3://climatedashboard-data/bmhd_30m_monthly_bkp/finalBMHD_ScaledVenice_202203.tif",
      "type": "image/tiff; application=geotiff; profile=cloud-optimized",
      "roles": [
        "data",
        "layer"
      ]
    },
  }
}

At the end of the data migration, the links should look like the following and the data should exist in that link:

{
  "assets": {
    "cog_default": {
      "href": "s3://veda-data-store-staging/nightlights-hd-monthly/finalBMHD_ScaledVenice_202203.tif",
      "type": "image/tiff; application=geotiff; profile=cloud-optimized",
      "roles": [
        "data",
        "layer"
      ]
    },
  }
}

Acceptance Criteria:

Checklist for collections

Checklist:

xhagrg commented 2 years ago

We should probably push to prod bucket rather than staging bucket.

slesaad commented 2 years ago

Misc TODOs

slesaad commented 2 years ago

Some pgstac database quirks ☠️ realised while migrating the datasets that we should be aware of:

abarciauskas-bgse commented 2 years ago

The data products are in s3://veda-data-store-staging and not s3://veda-data-store - @xhagrg @slesaad can we migrate the products to be in s3://veda-data-store? 🙏🏽

abarciauskas-bgse commented 2 years ago

We are having a longer discussion on slack so will follow up with next steps