malonge / RagTag

Tools for fast and flexible genome assembly scaffolding and improvement
MIT License
476 stars 50 forks source link
gap-filling genome-assembly scaffolding

logo_text

RagTag

DOI RELEASE CONDA GitHub

RagTag is a collection of software tools for scaffolding and improving modern genome assemblies. Tasks include:

RagTag also provides command line utilities for working with common genome assembly file formats.

logo_text

Getting Started

# install with conda
conda install -c bioconda ragtag

# correct a query assembly
ragtag.py correct ref.fasta query.fasta

# scaffold a query assembly
ragtag.py scaffold ref.fasta query.fasta

# scaffold with multiple references/maps
ragtag.py scaffold -o out_1 ref1.fasta query.fasta
ragtag.py scaffold -o out_2 ref2.fasta query.fasta
ragtag.py merge query.fasta out_*/*.agp other.map.agp

# use Hi-C to resolve conflicts
ragtag.py merge -b hic.bam query.fasta out_*/*.agp other.map.agp

# make joins and fill gaps in target.fa using sequences from query.fa
ragtag.py patch target.fa query.fa

Docs

Please see the Wiki for detailed documentation.

Dependencies

Citation

RagTag supersedes RaGOO:

Acknowledgments

Many of the major algorithmic improvements relative to RaGOO's first release were provided by Aleksey Zimin, lead developer of the MaSuRCA assembler. Luca Venturini suggested and initially implemented many feature enhancements, such as pysam integration. RagTag "merge" was inspired by CAMSA. The developer of CAMSA, Sergey Aganezov, helped review relevant RagTag code. RagTag "patch" was inspired by Grafter, a scaffolding tool written by Melanie Kirsche. Melanie provided guidance for the RagTag implementation. Michael Schatz has provided guidance for the whole project.