LieberInstitute / RNAseq-pipeline

Original version of the RNA-seq pipeline implemented in SPEAQeasy at https://github.com/LieberInstitute/SPEAQeasy.
http://research.libd.org/SPEAQeasy/
5 stars 0 forks source link

add step to assess other RNA quality measures #17

Open andrewejaffe opened 7 years ago

andrewejaffe commented 7 years ago
andrewejaffe commented 7 years ago

note we can just take all annotated ribosomal genes and calculate the ratio of their counts to the total number of gene counts, instead of using RSeQC...i've done this in the past and it works well

andrewejaffe commented 7 years ago

In the gencode annotation, there is an rRNA category for gene_type - just add a line of code that adds an rRNA_rate to the alignment metric table, like colSums(geneCounts[which(geneMap$gene_type == "rRNA"),])/colSums(geneCounts))

emilyburke commented 7 years ago

Cool that rRNA_rate code worked, thanks. What do you mean in the first bullet point? "add cross-species alignment with 1M reads"

andrewejaffe commented 7 years ago

Joo Heon aligns a small number of reads (like 1M) to other species to assess contamination during the library construction process. Basically it would involve:

  1. subset 1M reads
  2. align those reads to mm10 and rn6
  3. retain hisat2 summary metrics

I am not completely convinced this adds much more usable information, as in the past, those samples with high cross-mapping tended to have higher chrM and rRNA mapping since those are homologous genes and thus relate to worse quality samples...

However, we will need to do something like this for the Stem Cell paper, as the neurons were a mix of human (neurons) and rat (glia) but I was going to have steve or badoi do it as a project-specific run

-a

On Thu, Jan 26, 2017 at 11:31 AM, Emily Burke notifications@github.com wrote:

Cool that rRNA_rate code worked, thanks. What do you mean in the first bullet point? "add cross-species alignment with 1M reads"

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/LieberInstitute/RNAseq-pipeline/issues/17#issuecomment-275436009, or mute the thread https://github.com/notifications/unsubscribe-auth/AAmMaDpC9fx5SpW9t5kAvbroi7YDArOzks5rWMpvgaJpZM4Lt0rj .