NASA-IMPACT / csdap-cumulus

SmallSat Cumulus Deployment
Other
1 stars 0 forks source link

Migrate collection GE01_MSI_L1B to CBA Prod #325

Closed krisstanton closed 5 months ago

krisstanton commented 6 months ago

Migrate granules in collection GE01_MSI_L1B to CBA Prod by discovering/ingesting from existing prod account.

Acceptance criteria

To determine how many granules have been processed, first enter the Docker container:

DOTENV=.env.cba-prod make bash

In the container, run the following:

DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=completed

(note: due to a Cumulus bug, sometimes the status does not get properly updated. Try running these to match the numbers)

DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0
DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=queued
DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=running
DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=completed
DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=failed

You should see output similar to the following:

...
RESPONSE: {
  statusCode: 200,
  body: '{"meta":{"name":"cumulus-api","stack":"cumulus-prod","table":"granule","limit":0,"page":1,"count":8592},"results":[]}',
  headers: {
    'x-powered-by': 'Express',
    'access-control-allow-origin': '*',
    'strict-transport-security': 'max-age=31536000; includeSubDomains',
    'content-type': 'application/json; charset=utf-8',
    'content-length': '114',
    etag: 'W/"72-O2wUXhu+Q9J1hqdDrb0fcsZeFHo"',
    date: 'Fri, 01 Dec 2023 21:29:19 GMT',
    connection: 'close'
  },
  isBase64Encoded: false
}
[]

In particular, look at the value for body and within it, locate the value of "count". In the output above, the count should match the Earthdata Search granule count obtained in the very first step.

krisstanton commented 5 months ago

GE01_MSI_L1B has 712,613 Granules

jsrikish commented 5 months ago

Bumping up the message limit and execution limit to 1400 in app/stacks/cumulus/main.tf (Still well within the 2000 that the CMR team wants to limit us to)

krisstanton commented 5 months ago

GE01_MSI_L1B migration ingest is now running!

jsrikish commented 5 months ago

completed on Feb 7, 2024, 00:41:07.925 (UTC-06:00)

Athena Query Results: MissingCmrFile -- 138 TypeError -- 2

Count After running: DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=completed

DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 count:724763

DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=queued count:0

DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=running count:0

DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=completed count:724623

DEBUG=1 cumulus granules list -? collectionId=GE01_MSI_L1B___1 --limit=0 -? status=failed count:140

Earthdata search shows: 724,625 matching granules

chuckwondo commented 5 months ago

Nicely done! I'm wondering if the count difference of 2 between Earthdata Search count and the "completed" status count is from the 2 TypeErrors. If so, then running cumulus dead-letter-archive recover-cumulus-messages might resolve that discrepancy.

jsrikish commented 5 months ago

Issuing cumulus dead-letter-archive recover-cumulus-messages gave the same result