Functional-Metabolomics-Lab / FBMN-STATS

FBMN-STATS: A hitchhiker's guide to statistical analysis of Feature-based Molecular Networks
BSD 3-Clause "New" or "Revised" License
30 stars 25 forks source link

Google-Colab Files Upload

This repository contains the test data and the Jupyter notebooks for the paper The Hitchhiker’s Guide to Statistical Analysis of Feature-based Molecular Networks from Non-Targeted Metabolomics Data.
Using the notebooks provided here, one can perform data merging, data cleanup, blank removal, batch correction, and univariate and multivariate statistical analyses on their non-targeted LC-MS/MS data and Feature-based Molecular Networks.

The result files of the notebook can be found in the Google Drive:

MASSIVE Datasets from which all the files were selected for MZmine3 : MSV000082312 and MSV000085786

To easily install and run Jupyter Notebook in R, follow the steps in the document according to your preferred OS

Running the Notebooks (R, Python) on the cloud using Google Colab:

This Notebook can be also executed using Google Colab, a cloud environment for running Jupyter Notebooks. It is commonly used with Python and comes pre-installed with all essential Python packages. However, we can also run Colab with R Notebooks. Basic requirement for using Colab is to have a google account. No extra installation in your computer is needed as such for Jupyter Notebook.

Disclaimer:

For Code Copying:

For GNPS Quickstart Users:

For Google Colab Users:

For QIIME2 Users:

Follow the information on the README file in the QIIME2 folder and for more information, refer to the SI document of the article. Note: The Notebook provided here cannot be accessed using Google Colab.

For first time Colab users, some useful information to note:

1. Package Installation:

Since Colab does not come pre-installed with R packages (or libraries) when running our R Notebook in Colab, we need to install the packages every time we run the notebook and the installation might take some time. However, direct Jupyter Notebook users need to install it only once as it is installed locally.

2. Setting a working directory and loading input files:

Google-Colab Files Upload

utils::zip(zipfile = 'TestData_Results', files = "/My_TestData/")

4. Limitations of Google Colab:

Although Colab is easier to use and is all Cloud-based, the main problem with the Colab environment is when you leave the Colab notebook idle for 90 mins or continuously used it for 12 hours, the runtime will automatically disconnect. This means you will lose all your variables, installed packages, and files. Hence, you need to rerun the entire notebook. Another limitation is disk space of 77 GB for the user. Please be aware of this while working with larger datasets and running longer notebooks.