Closed isaacvock closed 6 months ago
Hi @isaacvock,
Thank you very much for your comment and your interest in factR2.
Yes, we are aware of this limitation of the .runAS feature and the current implementation is designed to pick up most common splicing types. We are constantly thinking of ways to improve this without compromising runtime, and we will try to detect more complex examples as you've described.
Hi @isaacvock , thank you for your patience with this. We have improved the splicing detection strategy of factR2 and it should now distinguish complex splicing events. Do give this a try and let us know if there are any problems.
Hello,
First off, thank you for this amazing tool! It has already proven incredibly useful in my research and is very well documented and easy to use.
The short version of my question: Would it be possible for factR2 to distinguish cassette exons that are only included in a single annotated isoform from those which are included in multiple isoforms and may have alternative 5' and 3' splice sites in some of these isoforms?
Longer version of my question: I dug into the source code for the
.runAS()
function called to assign splicing event type names to alternative exons, and was a bit surprised by how intronic regions are defined and used to annotate splicing events. To illustrate what I am referring to, I crafted a simple test example using a theoretical annotation for a single gene:Code to annotate splice events (basically just .runAS with "test" appended to end of object names):
which yields the final output of:
Of note is that the internal exons of both transcripts A and B are called "cassette exons" by factR2's strategy, since they are completely contained within a region that is intronic in at least one annotated isoform.
My expectation going into this exercise was that the splice annotation would note that the exon in transcript B is an alternative donor isoform of transcript A's internal exon. Would it be possible for future versions of factR2 to annotate the exon in transcript A as a cassette exon, but the exon in transcript B as an alternative donor event? Is there precedence in other tools that annotate splice events (e.g., alternative splice analysis tools like rMATS) to handle "cassette exons" that are included in only one isoform differently from those included in multiple isoforms?
I am interested in this functionality as I would like to identify instances where alternative splicing of the "cassette exon" influences properties of the two "cassette exon" containing isoforms (i.e., their half-lives).
Best, Isaac