NASA-IMPACT / csdap-cumulus

SmallSat Cumulus Deployment
Other
1 stars 0 forks source link

Change ingest Batch size from 5 back to 1 #223

Closed krisstanton closed 1 year ago

krisstanton commented 1 year ago

During ingest, granules are processed in batches of 5. If only one of those is missing the CMR file, it would prevent the ingest of the other 4. This ticket is just changing the setting from 5 to 1 for the next collections to ingest.

krisstanton commented 1 year ago

Applied Changes to PROD

➜  csdap-cumulus git:(iss105__ChangeBatchSizeTo1) ✗ DOTENV=.env.prod make bash
sha256:-----
WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested
WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested
(csdap-cumulus-prod-5982:prod):/work $ cumulus collections replace --data app/stacks/cumulus/resources/collections/WV01_Pan_L1B___1.json
{
  "createdAt": 1682720477267,
  "updatedAt": 1691005455987,
  "name": "WV01_Pan_L1B",
  "version": "1",
  "duplicateHandling": "replace",
  "granuleId": ".*",
  "granuleIdExtraction": "^(WV01_.+-P1BS-.+_P\\d{3}).+$",
  "sampleFileName": "WV01_20071231044012_1020010001E9B500_07DEC31044012-P1BS-505051421030_01_P024-BROWSE.jpg",
  "url_path": "{cmrMetadata.CollectionReference.ShortName}___{cmrMetadata.CollectionReference.Version}/{dateFormat(cmrMetadata.TemporalExtent.SingleDateTime, YYYY)}/{dateFormat(cmrMetadata.TemporalExtent.SingleDateTime, DDDD)}/{cmrMetadata.GranuleUR}",
  "meta": {
    "preferredQueueBatchSize": 1
  },
  "ignoreFilesConfigForDiscovery": false,
  "files": [
    {
      "regex": ".*-thumb[.]jpg$",
      "sampleFileName": "WV01_20071231044012_1020010001E9B500_07DEC31044012-P1BS-505051421030_01_P024-thumb.jpg",
      "type": "browse",
      "bucket": "public",
      "reportToEms": true
    },
    {
      "regex": ".*-BROWSE[.]jpg$",
      "sampleFileName": "WV01_20071231044012_1020010001E9B500_07DEC31044012-P1BS-505051421030_01_P024-BROWSE.jpg",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*-cmr[.]json$",
      "sampleFileName": "WV01_20071231044012_1020010001E9B500_07DEC31044012-P1BS-505051421030_01_P024-cmr.json",
      "type": "metadata",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*[.]tar$",
      "sampleFileName": "WV01_20071231044012_1020010001E9B500_07DEC31044012-P1BS-505051421030_01_P024.tar",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*[.]ntf$",
      "sampleFileName": "WV01_20071231044012_1020010001E9B500_07DEC31044012-P1BS-505051421030_01_P024.ntf",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*[.]xml$",
      "sampleFileName": "WV01_20071231044012_1020010001E9B500_07DEC31044012-P1BS-505051421030_01_P024.xml",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    }
  ],
  "reportToEms": true
}
(csdap-cumulus-prod-5982:prod):/work $ cumulus collections replace --data app/stacks/cumulus/resources/collections/WV04_Pan_L1B___1.json
{
  "createdAt": 1685547018555,
  "updatedAt": 1691005469324,
  "name": "WV04_Pan_L1B",
  "version": "1",
  "duplicateHandling": "replace",
  "granuleId": ".*",
  "granuleIdExtraction": "^(WV04_.+-P1BS-.+_P\\d{3}).+$",
  "sampleFileName": "WV04_20170505053805_594e4035-a94b-4da5-a5a2-9593702d7739-inv_17MAY05053805-P1BS-059102583030_01_P001-BROWSE.jpg",
  "url_path": "{cmrMetadata.CollectionReference.ShortName}___{cmrMetadata.CollectionReference.Version}/{dateFormat(cmrMetadata.TemporalExtent.SingleDateTime, YYYY)}/{dateFormat(cmrMetadata.TemporalExtent.SingleDateTime, DDDD)}/{cmrMetadata.GranuleUR}",
  "meta": {
    "preferredQueueBatchSize": 1
  },
  "ignoreFilesConfigForDiscovery": false,
  "files": [
    {
      "regex": ".*-thumb[.]jpg$",
      "sampleFileName": "WV04_20170505053805_594e4035-a94b-4da5-a5a2-9593702d7739-inv_17MAY05053805-P1BS-059102583030_01_P001-thumb.jpg",
      "type": "browse",
      "bucket": "public",
      "reportToEms": true
    },
    {
      "regex": ".*-BROWSE[.]jpg$",
      "sampleFileName": "WV04_20170505053805_594e4035-a94b-4da5-a5a2-9593702d7739-inv_17MAY05053805-P1BS-059102583030_01_P001-BROWSE.jpg",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*-cmr[.]json$",
      "sampleFileName": "WV04_20170505053805_594e4035-a94b-4da5-a5a2-9593702d7739-inv_17MAY05053805-P1BS-059102583030_01_P001-cmr.json",
      "type": "metadata",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*[.]tar$",
      "sampleFileName": "WV04_20170505053805_594e4035-a94b-4da5-a5a2-9593702d7739-inv_17MAY05053805-P1BS-059102583030_01_P001.tar",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*[.]tif$",
      "sampleFileName": "WV04_20170505053805_594e4035-a94b-4da5-a5a2-9593702d7739-inv_17MAY05053805-P1BS-059102583030_01_P001.tif",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    },
    {
      "regex": ".*[.]xml$",
      "sampleFileName": "WV04_20170505053805_594e4035-a94b-4da5-a5a2-9593702d7739-inv_17MAY05053805-P1BS-059102583030_01_P001.xml",
      "type": "data",
      "bucket": "protected",
      "reportToEms": true
    }
  ],
  "reportToEms": true
}
krisstanton commented 1 year ago

The configuration changes have now been merged with main.