TrinityCTAT / ctat-mutations

Mutation detection using GATK4 best practices and latest RNA editing filters resources. Works with both Hg38 and Hg19
https://github.com/TrinityCTAT/ctat-mutations
Other
73 stars 18 forks source link

Interoperability between CTAT tools #100

Closed tjbencomo closed 3 years ago

tjbencomo commented 3 years ago

Hi,

I'm trying to profile fusions, mutations, and expression from RNA-Seq data using programs from the CTAT pipeline. For consistency I thought it would be optimal to have a single BAM for each sample that can be run through each program (STAR-Fusion, CTAT Mutation pipeline, RSEM or Salmon).

I see that ctat-mutations and STAR-Fusion by default both use FASTQs as input and output BAM files, although both pipelines can use BAM files as input. Does it make sense to run one pipeline first and then use the output BAM file for the other programs? For example, I could run STAR-Fusion using the FASTQ files and then use the resulting BAM as input to the mutation calling pipeline and RSEM. Is there an alternative ordering that would be better? My concern is that there are alignment parameters specific to each pipeline that might cause issues for the other pipelines.

brianjohnhaas commented 3 years ago

Hi,

It would be best to use the STAR-Fusion pipeline to generate the bam, and then use that bam with the ctat mutations pipeline. Just be sure to include the STAR-Fusion parameter: --STAR_SortedByCoordinate (which it doesn't do by default, but will be needed for variant calling).

best,

~brian

On Mon, Sep 27, 2021 at 12:32 PM Tomas Bencomo @.***> wrote:

Hi,

I'm trying to profile fusions, mutations, and expression from RNA-Seq data using programs from the CTAT pipeline. For consistency I thought it would be optimal to have a single BAM for each sample that can be run through each program (STAR-Fusion, CTAT Mutation pipeline, RSEM or Salmon).

I see that ctat-mutations and STAR-Fusion by default both use FASTQs as input and output BAM files, although both pipelines can use BAM files as input. Does it make sense to run one pipeline first and then use the output BAM file for the other programs? For example, I could run STAR-Fusion using the FASTQ files and then use the resulting BAM as input to the mutation calling pipeline and RSEM. Is there an alternative ordering that would be better? My concern is that there are alignment parameters specific to each pipeline that might cause issues for the other pipelines.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/NCIP/ctat-mutations/issues/100, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABZRKX5ERPRTMW6YUY5XSITUECMCPANCNFSM5E27CTTA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

--

Brian J. Haas The Broad Institute http://broadinstitute.org/~bhaas http://broad.mit.edu/~bhaas

tjbencomo commented 3 years ago

Thanks for the guidance on this!