ISA-tools / mzml2isa

Parser to get meta information from mzML file and parse relevant information to a ISA-Tab structure
GNU General Public License v3.0
12 stars 6 forks source link

mzml2isa

Extract metadata from mzML and imzML files and write them to an ISA-Tab document.

|Version| |Py versions| |Git| |Bioconda| |Build Status (Actions)| |Build Status (AppVeyor)| |License| |RTD doc| |DOI| |Paper|

Overview

mzml2isa is a Python3 program that can automatically generate ISA-Tab document structure metadata files from raw XML metabolomics data files (mzML open access data format). The mzml2isa tool provides the backbone of ISA-Tab metabolomics study which can then be edited with an ISA editing tool, ISAcreator (see MetaboLights pre-packaged ISA Creator <http://www.ebi.ac.uk/metabolights/>__)

Features:

Install

See the Installation page <http://2isa.readthedocs.io/en/latest/mzml2isa/install.html> of the online documentation <http://2isa.readthedocs.io/en/latest/mzml2isa/index.html>.

Usage

CLI '''

.. code:: bash

mzml2isa -i /path/to/mzml_files/ -o /path/to/out_folder/ -s name_of_study

Python Module '''''''''''''

See the Usage page <http://2isa.readthedocs.io/en/latest/mzml2isa/usage.html> and the Examples page <http://2isa.readthedocs.io/en/latest/mzml2isa/examples.html> for more information.

Metabolights

To download some real data from MetaboLights <http://www.ebi.ac.uk/metabolights/>__ studies to test the converter with, run

.. code:: bash

python scripts/metabolights-dl.py <size>

from inside the repository, where size is the maximum size in GiB you can allocate to download files. The script will download the files to the example_files/metabolight\ s folder and then run mzml2isa on those files..

If you use a *NIX machine with curlftpfs and bash available, you can also run

.. code:: bash

scripts/metabolights.sh

to mount the database to the example directory and start converting mzML studies.

Workflow

.. figure:: static/mzml2isa.png :alt: workflow

.. |Build Status (Actions)| image:: https://img.shields.io/github/workflow/status/ISA-Tools/mzml2isa/Test?label=GitHub%20Actions&maxAge=3600&style=flat :target: https://github.com/ISA-tools/mzml2isa

.. |Build Status (AppVeyor)| image:: https://img.shields.io/appveyor/ci/Tomnl/mzml2isa.svg?style=flat&maxAge=3600&label=AppVeyor :target: https://ci.appveyor.com/project/Tomnl/mzml2isa

.. |Py versions| image:: https://img.shields.io/pypi/pyversions/mzml2isa.svg?style=flat&maxAge=3600 :target: https://pypi.python.org/pypi/mzml2isa/

.. |Version| image:: https://img.shields.io/pypi/v/mzml2isa.svg?style=flat&maxAge=3600 :target: https://pypi.python.org/pypi/mzml2isa/

.. |Git| image:: https://img.shields.io/badge/repository-GitHub-blue.svg?style=flat&maxAge=3600 :target: https://github.com/ISA-tools/mzml2isa

.. |Bioconda| image:: https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg?style=flat&maxAge=3600 :target: http://bioconda.github.io/recipes/mzml2isa/README.html

.. |License| image:: https://img.shields.io/pypi/l/mzml2isa.svg?style=flat&maxAge=3600 :target: https://www.gnu.org/licenses/gpl-3.0.html

.. |RTD doc| image:: https://img.shields.io/readthedocs/2isa.svg?style=flat&maxAge=3600 :target: https://2isa.readthedocs.io/en/latest/mzml2isa/

.. |DOI| image:: https://zenodo.org/badge/37276823.svg :target: https://zenodo.org/badge/latestdoi/37276823

.. |Paper| image:: https://img.shields.io/badge/paper-Bioinformatics-teal.svg?style=flat&maxAge=3600 :target: https://academic.oup.com/bioinformatics/article/33/16/2598/3204983