ndovro / ZWA

GNU General Public License v3.0
0 stars 0 forks source link

ZWA

The Zero Waste Algorithm (ZWA) is a pipeline comprised of various bioinformatic tools for context-based read trimming. The goal of this pipeline is to assist in viral genome assembly by retaining the viral part of reads and discarding any ribosomal hybrid bases the reads might contain.

DEPENDECIES:

Before running

Make sure you edit the script to specify paths and system related variables

Typical run

./zwa.sh RAW_READS.fastq reference.fasta

!!!LIMITATIONS!!!

1)This is an unoptimized proof of concept pipeline. It is quite slow

2)If your raw reads contain the letter 'S' in the contig name you should replace it before running the pipeline (eg. use sed)

3)This script has been tested on single-end raw reads only. Nothing suggests that it will not work for paired-end reads after pairing them.

For ease of use and to maximize performance, a Docker image with all the required software preinstalled has been created at https://hub.docker.com/repository/docker/ndovroli/zwa/. It is ready to deploy on any system/cloud with Docker.