Open turbomam opened 7 months ago
below is a list of slots that are in nmdc schema, but are only needed in the submission portal. These slots need to be store on the submission portal side, and able to be exported for the JGI metadata template. But do not need captured by NMDC
@pkalita-lbl @turbomam See the table above.
Next step for these JGI and EMSL slots I propose
The slots would need to be defined in this TSV file: https://github.com/microbiomedata/submission-schema/blob/main/schemasheets/tsv_in/slots.tsv
The association of slots to classes happens in this TSV file: https://github.com/microbiomedata/submission-schema/blob/main/schemasheets/tsv_in/classes.tsv. You can see that a lot of the slots in question are already associated with the appropriate Interface classes. Those would just need to be reviewed to make sure they're still accurate.
From slack
@turbomam "If some UF slots are gong to be moved to submission-schema in the near future, then I would prefer not to move them around within the nmdc-schema modules. I think Alicia and I moved some slots because she was using them outside of their original UF use-case, and that kept the schema from building. We should have included you ion that decison. I'm working on an issue to make all modules self sufficient (ie build on their own), and this will address the question you have asked along the line "how do I know which module to put new content in?" Can we leave the UF slots where they are until then?"
In summary for user facility slots we want to keep in NMDC schema, I'll add alias. In a later task we'll complete this issue, getting the user facility slots that NMDC does NOT need to track removed from NMDC schema, and only have them in NMDC submission portal. We will also later decide if the slots that we DO capture should remain in separate .yaml files or be moved to basic_slots.
@mslarae13 @pkalita-lbl is this an active issue? I'm going to remove this from this sprint and add to the backlog but if it's active please add to a future sprint.
For example,
rna_volume
is defined in nmdc-schema and associated withBiosample
. We even have some tests for it.If it is important to capture that information in the SubmissionPortal, so that it can be passed on to user facilities, but it is never going to be saved into MongoDB, maybe it doesn't belong in nmdc-schema?
Right now, I don't think we have a way to introduce slots into submission-schema other than by extraction from nmdc-schema, but that doesn't seem like a big technical challenge.
see this example data file, which I revised in a currently-unmerged PR.