Gaius-Augustus / GUSHR

Generating UTRs from SHort Reads
GNU General Public License v3.0
9 stars 2 forks source link

GUSHR

Assembly-free construction of UTRs from short read RNA-Seq data on the basis of coding sequence annotation.

This tool has been adapted to the format needs of AUGUSTUS/BRAKER and employs GeMoMa for generating UTRs from RNA-Seq coverage data.

Contacts for Github Repository of GUSHR:

Katharina J. Hoff, University of Greifswald, Germany, katharina.hoff@uni-greifswald.de, +49 3834 420 4624

Authors of GUSHR:

Katharina J. Hoffa, b, Mario Stanke a, b and Jens Keilwagenc

[a] University of Greifswald, Institute for Mathematics and Computer Science, Walther-Rathenau-Str. 47, 17489 Greifswald, Germany

[b] University of Greifswald, Center for Functional Genomics of Microbes, Felix-Hausdorff-Str. 8, 17489 Greifswald, Germany

[c] Julius Kühn-Institut, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany

Software Requirements

GUSHR is a script written in Python3 that runs a number of external tools, some of which are Bash core utils. Thus, GUSHR should be executed on a Linux system (we developed and tested code on Ubuntu 20.04.1 LTS).

Required Bash core utils:

Other software dependencies:

GUSHR runs a GeMoMa.jar file that is supplied with GUSHR. This GeMoMa.jar file was originally created by Jens Keilwagen et al. with Java 1.8 (http://www.jstacs.de/index.php/GeMoMa#Download, https://github.com/Jstacs/Jstacs) and is here re-distributed. You have the option to specify a different (e.g. more recent) GeMoMa.jar file when calling GUSHR but compatibility is then not guaranteed.

Test GUSHR

First, you need to retrieve an RNA-Seq example file that is not included on github due to its size:

wget http://bioinf.uni-greifswald.de/bioinf/braker/RNAseq.bam

(or: cd example; ./download_rnaseq.sh).

After that, and if all software requirements are satisfied, you can test GUSHR:

./gushr.py -t example/augustus.gtf -b example/RNAseq.bam -g example/genome.fa -o utrs

What to Cite

If you used GUSHR in your work, please cite the following sources: