hammerlab / biokepi

Bioinformatics Ketrew Pipelines
Apache License 2.0
27 stars 4 forks source link

Add support for FusionCatcher #368

Open iskandr opened 7 years ago

iskandr commented 7 years ago

Identifies fusion proteins from RNAseq and also catches expression of viral sequences:

https://github.com/ndaniel/fusioncatcher

hammer commented 7 years ago

May also want to consider https://sites.google.com/site/bioericscript as it's what UCSC's ProTECT uses for fusion calling in their neoepitope selection pipeline.

arahuja commented 7 years ago

I actually did this at some point: https://github.com/hammerlab/biokepi/commit/4f3449d4f7bd06bde609b7a759daf6d59e883ad6

I'm 80% certain this worked, but will re-test and submit a PR. The issue was that on many, many BAMs this required > 60 GB memory and would fail, which was really unfortunate.

hammer commented 7 years ago

Grab a highmem type! https://cloud.google.com/compute/docs/machine-types

arahuja commented 7 years ago

Yea that is actually what spurred my move to gcloud originally, got a 128G machine to run all of our samples through.