Open olgabot opened 10 years ago
Hi Olga, Thanks for this nice test case - I just reproduced it. It's most certainly a bug. I'm looking into it now.
Any news on this?
Hi Olga,
It appears that the read length is 92 and your short isoform is 90, less than the read length, and this is what caused it.
Thanks for digging into this! The exon lengths are: 57, 78, 33. So is the "short isoform" exon1+exon3 (57+33 = 90)?
If the read length is longer than the isoform length, why would the event get thrown out? There's perfect coverage of the event.
I agree it's not intuitive, but here's one explanation. Since your read length is 92 and the shortest isoform is 90, it's not possible -- unless you consider non-end-to-end alignments, which MISO doesn't -- to have exclusion reads. That is, you by definition can't observe skipping of the exon in this setup (e.g. in single end reads). So based on the data, the Psi could be 100% because there's no skipping, or it could just be that you can't observe the exclusion isoform. This edge case will have to be dealt with in a more intuitive way in the next version. If you have thoughts on best behavior, let me know. --Yarden
True, but that ignores that there's possible exons upstream and downstream of this particular event. Can future versions just look at the parts of the read that overlap with the event? What problems could this create? I suppose if this was an AFE or ALE event and the read overlapped the beginning/end of the first exon, that would be inconsistent with the annotation. But if the read was soft-clipped, would this matter?
The exon-centric annotations are intended to be only of an alternative trio of interest, so if you use that kind of annotation, that's the only thing MISO knows about -- looking at exons upstream and downstream of the event would require knowledge about which genome to look at, which reference annotation, etc. You get the same effect by making an annotation that includes more of the flanking upstream and downstream exons of the event. If your reads are length X, you could include enough flanking exons such that no isoform is shorter than X (ideally not shorter than your average insert length either). That would require deciding which exons should be included (since they could be alternative too, etc.), so to keep things generic it's up to the annotation to determines what gets considered.
Getting a similar bug again, this time on RI events. The event is in MYL6, 'chr12:56553371-56553406:+@chr12:56553759-56553932:+'
is called as 0.5 in these samples, even though I don't see any intron retention, and think the values here should be "0."
Here's one of the bam files, with just reads from the MYL6 locus.
I don't think this is the same as the short isoform bug, because isoforms A and B are 562 and 210, respectively, and my reads are 100bp.
This was using misopy v0.5.2
Hello there, I was getting Psi scores of 0.5 for events that are definitely 100% included and Psi should be 1. Any idea why this might be happening?
Here's the event:
Here's an example *.miso file:
Here's the bam file
Here's the command:
And here's the settings file: