frattalab / PAPA

PAPA (Pipeline-Alternative Polyadenylation) - Snakemake pipeline for analysis of APA from short-read RNA-seq data
GNU General Public License v3.0
1 stars 0 forks source link

filter_tx_by_intron_chain.py - add minimum length filters for extension events #19

Closed SamBryce-Smith closed 2 years ago

SamBryce-Smith commented 2 years ago

Due to imprecision of annotation / predicted assembly, small differences of 1-2nt will be called an extension event despite likely being reassembly of the annotated isoform.

Also, due to last exons having shorter/longer isoforms, a transcript can be called as an extension of the shorter isoform, when in reality it is just reassembly of the longer isoform. Simple way around this is to take the smallest extension length for each isoform