microbiomedata / issues

public repo for issues related to NMDC work
1 stars 0 forks source link

Milestone - Parameterized reference selection for NOM and metabolomics (2.18) #506

Open ssarrafan opened 11 months ago

ssarrafan commented 11 months ago

Informed metabolomics and NOM search space Due to the metabolome's highly complex and still largely undiscovered molecular composition, de novo analysis strategies, often used in proteomics and genomics data workflows, are not feasible for mass spectrometry-based metabolomics data annotation. Instead, metabolomics data workflows rely on comparison of experimental data to analytical data collected on molecular standard compounds or to predictive models of analytical features, such as molecular fragmentation patterns, fine isotopic structures, and chromatographic retention time. Therefore, molecular reference databases, both molecular standards-based and in-silico, are fundamental components that support metabolite identification. However, selecting the molecular search space for both NOM and metabolomics is a manual, empirical process that relies on knowledge of the sample and experiment parameters (e.g., sample extraction solvent, chromatography separation technique, etc.). Therefore, we will leverage NMDC’s sample metadata and the addition of detailed sample preparation metadata to our schema (see Development of metabolomics and metaproteomics standards), to automate the parameterization and data reference selection for the NOM and metabolomics workflows (Milestone 2.18). This effort will increase the accuracy of metabolite annotation and greatly reduce false positive assignments.

Page 33

ssarrafan commented 1 month ago

Scheduled for 2025Q4. Moving this issue. FYI @lamccue @emileyfadrosh

aclum commented 1 month ago

Appears to duplicate #505?

ssarrafan commented 1 month ago

Appears to duplicate #505?

Thanks @aclum This was one of the repeating milestones. I just closed 505. I'll keep this one open.