microbiomedata / nmdc-metadata

Managing metadata and policy around metadata in NMDC
https://microbiomedata.github.io/nmdc-schema/
Other
2 stars 0 forks source link

S4G2 - Metadata ingest, Register new metadata with NMDC runtime #376

Closed ssarrafan closed 3 years ago

ssarrafan commented 3 years ago

This is step 5 of Goal 2 for the July sprint.

wdduncan commented 3 years ago

Pasting link to goals document: https://docs.google.com/document/d/1iBNXkBn24ZkmJkeptoqpyjcz5PAQU59ZjMOzDObnx4E/edit?ts=60d55721

Here is goal 2 step 5 from the document:

New unvalidated workflow metadata is available
- Validation against the schema (Donny) 
- Ingest into Mongo (Donny)
-  Update NMDC runtime with new valid data available (Donny) 
ssarrafan commented 3 years ago

Moving to August sprint since step 4 that precedes this is not done and has been moved to August sprint.

dehays commented 3 years ago

I believe this issue relates to including metadata for the SPRUCE SFA study / biosamples / sequencing projects from GOLD and the biosamples and instrument processes from EMSL.

  1. The study and biosamples have already been included from GOLD. Need to include the sequencing projects (either from GOLD API or from GOLD DB Dump).
  2. Need to determine who is responsible for creating the data object for the raw data file output from the instrument process.
  3. For EMSL samples and instrument runs - need to track down information that Sam compiled. Montana may be able to help with EMSL sample matching (to GOLD biosamples) and identifying EMSL only samples and their metadata.
ssarrafan commented 3 years ago

@wdduncan @dehays can this issue be closed?

ssarrafan commented 3 years ago

Closing per slack message from David.