lch14forever / shotgunmetagenomics-nf

MIT License
3 stars 5 forks source link

Shotgun Metagnomics Pipeline

Description

This is a Nextflow re-implementation of the original pipeline used by Computational and Systems Biology Group 5 (CSB5) at the Genome Institute of Singapore (GIS).

中文文档

Development plan

Features

Dependencies

Main pipeline

Quality control and host DNA decontamination

Reference based analysis

Setup and configuration

Usage

Run with docker

$ shotgunmetagenomics-nf/main.nf -profile docker --read_path PATH_TO_READS

Run on AWS batch (AWS batch configuration tutorial)

$ shotgunmetagenomics-nf/main.nf -profile awsbatch --awsqueue AWSBATCH_QUEUE --awsregion AWS_REGION --bucket-dir S3_BUCKET --outdir S3_BUCKET 

You can specifiy multiple profiles separated by comma, e.g. -profile docker,test.

Run multiple profilers

$ shotgunmetagenomics-nf/main.nf -profile gis --profilers kraken2,metaphlan2 --read_path PATH_TO_READS

Usage cases

Adding a module

  1. Write a module and put it into modules/
  2. Add to the main script main.nf
  3. Modify the configuration file conf/base.config to add resources required (for GIS users, modify conf/gis.config as well for the specific conda envrionment)
  4. Add conda and docker files for the new module

Contact

Chenhao Li: lich@gis.a-star.edu.sg, lichenhao.sg@gmail.com