microbiomedata / nmdc_automation

Prototype automation
3 stars 2 forks source link

Update gold:Gs0110138 / nmdc:sty-11-33fbta56 SPRUCE (metagenomes, NOM, proteomics, metatranscritpomes) #167

Closed mbthornton-lbl closed 1 month ago

mbthornton-lbl commented 1 month ago

Omics Type(s): metagenomes, metatranscriptomes, proteomics, NOM

Steps: Missing Data

Note: omics_processing was not loaded during testing, so we will need to re-run extract-records and process-records below QC data addition

Metagenomics - Note filesystem updates for SPRUCE are here: #175

Proteomics:

mbthornton-lbl commented 1 month ago

ingest-records failsL

ERROR:root:An error occurred - aborting transaction
ERROR:root:An error occurred while ingesting records: 500 Server Error: Internal Server Error for url: https://api-napa.microbiomedata.org/metadata/json:validate
Traceback (most recent call last):
  File "/Users/MBThornton/Documents/code/nmdc_automation/nmdc_automation/re_iding/scripts/./re_id_tool.py", line 731, in ingest_records
    _ingest_records(db_records, db_client, api_user_client)
  File "/Users/MBThornton/Documents/code/nmdc_automation/nmdc_automation/re_iding/scripts/./re_id_tool.py", line 757, in _ingest_records
    if api_user_client.validate_record(record):
  File "/Users/MBThornton/Documents/code/nmdc_automation/nmdc_automation/api/nmdcapi.py", line 410, in validate_record
    response.raise_for_status()
  File "/Users/MBThornton/Library/Caches/pypoetry/virtualenvs/nmdc-automation-VEpwcKpc-py3.9/lib/python3.9/site-packages/requests/models.py", line 1021, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 500 Server Error: Internal Server Error for url: https://api-napa.microbiomedata.org/metadata/json:validate
INFO:root:Elapsed time: 3.8852598667144775
ssarrafan commented 1 month ago

Sprint over, removing from sprint.

mbthornton-lbl commented 1 month ago

Data Prerequisites for SPRUCE

nmdc:sty-11-33fbta56_biosample_set.json nmdc:sty-11-33fbta56_omics_processing_set.json nmdc:sty-11-33fbta56_updated_record_identifiers.txt

@aclum These are the data docs from the automation repository napa_reid_data_and_logs branch, the same as were used in the first, failed reid attempt. I am assuming we will be using these for a second run - could you please have a look at these and confirm?

mbthornton-lbl commented 1 month ago

nmdc:sty-11-33fbta56_prot_data_objects.json

aclum commented 1 month ago

Okay to close :)