Open andrewejaffe opened 7 years ago
note we can just take all annotated ribosomal genes and calculate the ratio of their counts to the total number of gene counts, instead of using RSeQC...i've done this in the past and it works well
In the gencode annotation, there is an rRNA
category for gene_type
- just add a line of code that adds an rRNA_rate
to the alignment metric table, like colSums(geneCounts[which(geneMap$gene_type == "rRNA"),])/colSums(geneCounts))
Cool that rRNA_rate code worked, thanks. What do you mean in the first bullet point? "add cross-species alignment with 1M reads"
Joo Heon aligns a small number of reads (like 1M) to other species to assess contamination during the library construction process. Basically it would involve:
I am not completely convinced this adds much more usable information, as in the past, those samples with high cross-mapping tended to have higher chrM and rRNA mapping since those are homologous genes and thus relate to worse quality samples...
However, we will need to do something like this for the Stem Cell paper, as the neurons were a mix of human (neurons) and rat (glia) but I was going to have steve or badoi do it as a project-specific run
-a
On Thu, Jan 26, 2017 at 11:31 AM, Emily Burke notifications@github.com wrote:
Cool that rRNA_rate code worked, thanks. What do you mean in the first bullet point? "add cross-species alignment with 1M reads"
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/LieberInstitute/RNAseq-pipeline/issues/17#issuecomment-275436009, or mute the thread https://github.com/notifications/unsubscribe-auth/AAmMaDpC9fx5SpW9t5kAvbroi7YDArOzks5rWMpvgaJpZM4Lt0rj .