Open turbomam opened 2 years ago
Funny, I thought there was also a ticket to "add env_package"
It can be inferred, but it's not in the sheet & we need it in the sheet in EMSL because of how we're collecting the information. If there's a way to make it auto-populate, great but the slot shouldn't be totally removed.
I have a ticket with GSC that env_package
should be added back to the MIxS model. Theoretically we're using it off the books now.
Yes, we should be able to share tabular data about biosamples including an env_package
column.
When I convert the biosample metadata collected from DataHarmonizer, so that it can be inserted into MongoDB, the env_package
is merged in. In this case, the env_package
value comes from the study description pages. It can be reported out as a table with the env_package
column during the conversion, or after being loaded into MongoDB.
@turbomam moving this to Sept but please let me know if you're not actively working on it for the next 2 weeks
Yes, we should be able to share tabular data about biosamples including an
env_package
column.When I convert the biosample metadata collected from DataHarmonizer, so that it can be inserted into MongoDB, the
env_package
is merged in. In this case, theenv_package
value comes from the study description pages. It can be reported out as a table with theenv_package
column during the conversion, or after being loaded into MongoDB.
If so, then the column can go away. Just need to be sure it's included to the template that's shared with EMSL.
See Kitware's submission retrieval API, https://data.dev.microbiomedata.org/api/metadata_submission?offset=0&limit=25
It captures the environmental package in the results.[].metadata_submission.packageName
path
It will be retained by the biosample instantiated and loaded into MongoDB
So, yes. I think we need to keep the column until we have a better way of submitting and parsing the metadata for each institution (EMSL, JGI). I'm currently doing it by hand. But once we have that we can remove the column.
With that, I think this is 'back log' until we have that process functional.
I don't understand. There is no need t capture the environmental package in the DH template.
If doing something by hand puts constraints on the workflow, let's figure out how we can automate your work.
Let's remove it, and then you can show me the negative consequences.
Don't remove it yet. Negative consequence is when someone submits samples to EMSL using the NMDC portal, I need that column. If we remove it, the only way I can access it is by going into the submission portal, finding that persons specific submission, and clicking through until I get to the "package" tab.
By hand is because there is no "submit" metadata and there is no automated way of converting from NMDC submission portal template to JGI or EMSL template formats. So I have to copy and shuffle columns manually.
Until we have the process of
We're a long way from having this functional.
Ok, I can kinda see that but would appreciate it if you could walk me though the process.
What we really need is for a display of the env package (or even the full template name) in the header of the templates
any comments, @mslarae13 ?