WMBEdmands / compMS2Miner

metabolite identification R-package for metabolomic high-resolution LC-MS datasets.
14 stars 8 forks source link

compMS2Miner_logo

compMS2Miner is an R package for comprehensive and automatable annotation of metabolomic high-resolution LC-MS datasets.

DOI latest stable release v2.3.0 (archived on zenodo). bug fixed in matchSpectralDB InChI codes now in Comments fields of MoNA msp files. InChIKeys are now the unique identifiers however these are dead-ends for OpenBabel. While loop to catch any OpenBabel errors

Purpose

A long-standing challenge of untargeted metabolomic profiling by liquid-chromatography - high resolution mass spectrometry analysis (LC-HRMS) is rapid, precise and automatable transition from unknown mass spectral features in the form of a peak-picking software output table to full metabolite identification using MS2 fragmentation data.

The number of MS2 spectra commonly collected in a precursor selection based experiment (often thousands in a single MS2 datafile) limits the feasibility of painstaking manual interpretation of every spectrum. A degree of autonomous unknown annotation and at the very least a confident annotation of an unknowns most probable compound class is highly desirous. A holistic overview of the relationships between spectra can greatly facilitate the correct annotation of statistically relevant unknowns. When a handful of unknowns are targeted in isolation the broader context of an unknown can be easily missed and therefore putative identities poorly interpreted.

compMS2Miner is a package in the popular R programming language developed for comprehensive unknown feature annotation using peak-picker output files and MS2 data files as inputs (.mzML, .mzXML, .mgf). compMS2Miner brings together many useful metabolite identification tools (see Features section below) and is modular and every workflow method is therefore fully extensible.

Data curation, visualization and sharing is made possible at any stage of the compMS2Miner package workflow via an application Composite MS2 Explorer developed with the R shiny package. The application allows the user to rapidly create their own study-specific MS2 databases for each of their chromatographic methods. Additionally an msp database file can also be rapidly generated from the output of the compMS2Miner workflow.

If you find compMS2Miner useful for your metabolite annotation challenges please remember to cite us:

compMS2Miner: an automatable metabolite identification, visualization and data-sharing R package for high-resolution LC-MS datasets William Matthew Bell Edmands, Lauren M. Petrick, Dinesh Kumar Barupal, Augustin Scalbert, Mark Wilson, Jeffrey Wickliffe, and Stephen M Rappaport Analytical Chemistry Article ASAP DOI: 10.1021/acs.analchem.6b02394

Installation

N.B. Forewarned is forearmed. Users have reported difficulty installing the correct version of Java (which seems to be a common issue). Java is required for rcdk and it's dependency rJava however this package is only required for usage of the metID.rtPred function (specifically molecular descriptor calculation). If installation of Java is too problematic/frustrating then this function can be skipped in your compMS2Miner workflow.

Ensure that you install Java from the manual downloads page https://java.com/en/download/manual.jsp

Make sure to install the Offline version which matches your operating system and your computers processor if using Windows (i.e. 32 bit or 64 bit).

1. install the latest development version and all package dependencies in one-line of code directly from GitHub using the devtools package. First ensure devtools is installed, instructions can be found here: https://github.com/hadley/devtools

devtools::install_github('WMBEdmands/compMS2Miner', dependencies=c("Depends", "Imports", "Suggests"), build_vignettes=TRUE)

-Or-

2. Download the latest stable release tar /zip and install package locally. The devtools install_local function ensures all package dependencies are also installed.

# N.B. include full file path to your downloads directory
devtools::install_local('compMS2Miner_v2.2.6.tar.gz')

Getting started

After compMS2Miner is installed begin by reading the package vignette "compMS2Miner_Workflow" Just type vignette('compMS2Miner_Workflow') to view the pdf of the workflow. Or view an html version of the vignette on the web by clicking the image below:

compMS2Miner_logo

Example data illustrating compMS2Miner is provided internal to the package and consists of a peak-picker output table from a nano-flow LC-HRMS metabolomic dataset of human blood samples and corresponding data-dependent MS2 data files. An example workflow using this data is illustrated in the package vignette.

Features

The compMS2Miner structured workflow performs the following (v2.3.0, 2017/04/18):

An example Composite MS2 Explorer application created using the example data (within extdata of the package) is hosted on the shinyapps.io site here:

Upon completion of the compMS2Miner workflow the user can then load the Composite MS2 Explorer app from the compMS2 object directly or from a zip file (compMS2explorer()) and systematically examine each composite spectrum (including any automatic annotation made) using all of the available tools provided in the interface. Once a decision has been made on a putative annotation the user can then make potentially detailed comments in the interactive table. In this way metabolite identification decisions can be effectively and very efficiently recorded (such as links to journal articles and other pieces of evidence in support of an assignment).

As a final step following systematic evaluation of the data presented by the Composite MS2 Explorer app it is intended that the user publishes the application to the shinyapps.io site or as a self-contained zip file that can be easily viewed by others. Using the compMS2Miner function publishApp() the application can be publically deployed and explored by other investigators and all of the now read-only interactive table comments can be viewed. However, the user is still able to redeploy the application to the shinyapps.io site if any updates are necessary or just recreate the self-contained zip file. In theory, the published self-contained Composite MS2 Explorer app should be viewable in perpetuity.

This app publication approach could provide a feasible mechanism for transparency, generate experimental spectrum databases and a helpful way to share metabolite identification data alongside metabolomic/lipidomic publications.

Please give us your valuable feedback on anything you like/don't like and any suggestions for improvement or alternative methods you may find useful. We are always open to collaborations from fellow metabolomic investigators.

Licence

The compMS2Miner package is licenced under the GPLv3 (http://www.gnu.org/licenses/gpl.html).