NASA-IMPACT / csdap-cumulus

SmallSat Cumulus Deployment
Other
1 stars 1 forks source link

Ingest Missing WV02_MSI_L1B granules from DB into Production from MCP #377

Open jsrikish opened 3 months ago

jsrikish commented 3 months ago

Ingest granules in collection WV02_MSI_L1B to CBA Prod by discovering/ingesting from MCP account.

Note: Some of the WV02_MSI_L1B granules have already been ingested into cumulus; there were a lot of missing granules which did not have entry in the DB; For those missing granules, checksums were calculated and inserted into DB In the "collections/WV02_MSI_L1B___1_2022.json" file "duplicateHandling": "skip" is set to skip to avoid re-ingesting granules which have been ingested.

Following steps will be repeated for 2022-2009, one year at a time starting with 2022

Acceptance criteria

To determine how many granules have been processed, first enter the Docker container:

DOTENV=.env.cba-prod make bash

In the container, run the following:

DEBUG=1 cumulus granules list -? collectionId=WV02_Pan_L1B___1 --limit=0 -? status=completed

(note: due to a Cumulus bug, sometimes the status does not get properly updated. Try running these to match the numbers)

DEBUG=1 cumulus granules list -? collectionId=WV02_MSI_L1B___1 --limit=0
DEBUG=1 cumulus granules list -? collectionId=WV02_MSI_L1B___1 --limit=0 -? status=queued
DEBUG=1 cumulus granules list -? collectionId=WV02_MSI_L1B___1 --limit=0 -? status=running
DEBUG=1 cumulus granules list -? collectionId=WV02_MSI_L1B___1 --limit=0 -? status=completed
DEBUG=1 cumulus granules list -? collectionId=WV02_MSI_L1B___1 --limit=0 -? status=failed

You should see output similar to the following:

...
RESPONSE: {
  statusCode: 200,
  body: '{"meta":{"name":"cumulus-api","stack":"cumulus-prod","table":"granule","limit":0,"page":1,"count":8592},"results":[]}',
  headers: {
    'x-powered-by': 'Express',
    'access-control-allow-origin': '*',
    'strict-transport-security': 'max-age=31536000; includeSubDomains',
    'content-type': 'application/json; charset=utf-8',
    'content-length': '114',
    etag: 'W/"72-O2wUXhu+Q9J1hqdDrb0fcsZeFHo"',
    date: 'Fri, 01 Dec 2023 21:29:19 GMT',
    connection: 'close'
  },
  isBase64Encoded: false
}
[]

In particular, look at the value for body and within it, locate the value of "count". In the output above, the count should match the Earthdata Search granule count obtained in the very first step.

hbparache commented 3 months ago

Maxar Data Conversion and Cumulus Ingest Tracking

jsrikish commented 3 months ago

Year total # granules in Earthdata previous count 2022 555976 98950 2009 1007