Oshlack / JAFFA

JAFFA is a multi-step pipeline that takes either raw RNA-Seq reads, or pre-assembled transcripts, then searches for gene fusions
https://github.com/Oshlack/JAFFA/wiki
Other
87 stars 21 forks source link

pacbio pre-processing #73

Closed gr8ape closed 2 years ago

gr8ape commented 2 years ago

Hi, Do you recommend using the pacbio reads that have been polished with CCS or is their benefit to using the raw subreads.bam converted to fastq as input?

nadiadavidson commented 2 years ago

Hi, Great question. If it's easy to try both, I would do that on a few samples to see which gives the better sensitivity. In theory the raw data should be fine and JAFFAL can tolerate quite a high error rate. However, if each subread is treated as an individual read this might inflate the number of reads supporting fusions and you'll likely get a lot of false positives. Results from CCS should be safer, but it's not clear to me how the sensitive would compare to the raw reads. Very happy for you to report back if you test this out!

Cheers, Nadia.

gr8ape commented 2 years ago

@nadiadavidson thanks for the prompt reply. I'll post back here once I've done some additional R&D and let you know what we found