SinOncology / BreakID

BreakID: genomics breakpoints identification to detect gene fusion events using discordant pairs and split read
GNU General Public License v3.0
13 stars 6 forks source link

BreakID

BreakID is an genomic breakpoint prediction method that can discover and genotype translocations, tandem duplications, inversions and translocations at single-nucleotide resolution in short-read massively parallel sequencing data. It uses discordant read pairs and split-reads to sensitively and accurately delineate genomic rearrangements.

Prerequisits for BreakID

nib files you can prepare them by the following steps:

mkdir nib_files

cd nib_files

rsync -avzP rsync://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/ .

gunzip *.gz

wget https://users.soe.ucsc.edu/~kent/src/blatSrc35.zip

unzip blatSrc35.zip

Installing BreakID

The easiest way to get BreakID is to download a statically linked binary from the BreakID github release page. You can also build BreakID from source.

git clone https://github.com/SinOncology/BreakID.git

cd BreakID/

sh install.sh

Running BreakID

BreakID needs a sorted, indexed and duplicate marked bam file for every input sample. An indexed reference genome is required to identify split-reads. The output is in plain txt format.

bin/BreakID -i input_bam -o out_prefix -n /path/to/nib/directory

FAQ

Citation

Linfang Jin, Jinhuo Lai, Yang Zhang, Ying Fu, Shuhang Wang, Heng Dai, Bingding Huang. BreakID: genomics breakpoints identification to detect gene fusion events using discordant pairs and split read.
Bioinformatics, 2018 accepted.

License

BreakID is distributed under the GPLv3. Consult the accompanying LICENSE file for more details.