mhammell-laboratory / TEtranscripts

A package for including transposable elements in differential enrichment analysis of sequencing datasets.
http://hammelllab.labsites.cshl.edu/software/#TEtranscripts
GNU General Public License v3.0
206 stars 29 forks source link

Counting TEs in introns #81

Closed mlizio closed 1 year ago

mlizio commented 3 years ago

Hi, I have RNA-seq data from nuclear fractions. I quantify my gene expression by considering the whole gene, so if I want to use TEtranscripts with my gene annotation, what happens to TEs falling into introns? Will they contribute to the expression of both genes and TEs?

Would it be better to skip TEs in introns?

Thank you

olivertam commented 3 years ago

Hi,

Thank you for your interest in the software. Working with nuclear fractions is certainly a difficult problem, and one that we are still wrapping our head around. In its current form, it is not optimized for nuclear fractions precisely because of the presence of TE in introns. In order to capture intronic reads and assign them correctly, you will need to have a GTF where the entire transcript is considered an exon (similar to the approach that Cell Ranger does for single cell analysis, see bottom of this page).

However, the way that TEtranscripts treats intronic TE is that it will assign unambiguously mapped reads to the gene (intron), but ambiguously mapped reads to the TE. Thus, there's a possibility that a highly expressed gene with an intronic TE that is also present elsewhere to have reads assigned to the TE instead.

You can certainly remove intronic TE from your annotations. This might lead to an underestimation of TE reads (and overestimation of a gene read), but might be the best compromise.

This is still an active area of development, and hopefully we will have something that could better handle RNA-seq from nuclear fractions soon.

Thanks.

github-actions[bot] commented 1 year ago

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days

github-actions[bot] commented 1 year ago

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days