rawgene / rawg

RNA-Seq Analysis Workflow Generator
GNU General Public License v3.0
6 stars 0 forks source link
cwl rna-seq rna-seq-analysis

RAWG: RNA-Seq Analysis Workflow Generator

Build Status Gitter Google group : RAWG Lighting Talk Poster

RAWG is born out of a student project at Imperial College London. The aim of this project is to provide an easy and intuitive way for researchers to conduct RNA-Seq analysis and compare different analysis pipelines, by automatically generating workflows based on user-selected tools.

Note: RAWG was previously hosted in a single repo under a personal account (with 700+ commits). The project was reorganised and transferred to this organisational account to better decouple different components of RAWG. The origanal project, which is under archival now, can be seen here

At its core, RAWG is built with three components:

RAWG was presented as a lighting talk and poster at BOSC2019 in Basel, Switzerland.


Please cite as:
Greco AP, Hedley-Miller P, Jesus F et al. RAWG: RNA-Seq analysis workflow generator [version 1; not peer reviewed]. F1000Research 2019, 8:1215 (poster) (https://doi.org/10.7490/f1000research.1117143.1)


Motivation

RNA sequencing (RNA-Seq) is becoming the golden standard for analysing gene expressions in biological samples. Many analysis pipelines were developed, enabling standardisation and automation of RNA-Seq data analysis. However, different methodologies leads to wide variation in the number of genes called significant and different workflows exhibit a precision/recall tradeoff. Therefore, the ultimate choice of workflow should take into consideration how the results will be used in subsequent applications (Williams et al.).

RAWG is our solution to the ever-evolving RNASeq tools. We aim to provide a comprehensive and easy to use platform that researchers have the freedom of choosing different tools based on their need.

Current tools

We have included tools for differential gene expression (DGE), differential isoform expression (DIE) and differential exon expression. The complete tools and possible connections are shown below.

tools

RAWG architecture

A typical setup of an analysis workflow system using CWL is shown below. RAWG's premises is highlighted in yellow. By design, RAWG's components are highly decoupled which means different parts can be substituted for different purposes. Meanwhile, this architecture allows the change of CWL runner so that RAWG can support more platforms.

architecture

Current stage of development

RAWG is still in its infancy, we are actively looking for contributors and welcome any comments, suggesttions and ideas. In particular, we are looking for