davidaknowles / leafcutter

Annotation-free quantification of RNA splicing. Yang I. Li, David A. Knowles, Jack Humphrey, Alvaro N. Barbeira, Scott P. Dickinson, Hae Kyung Im, Jonathan K. Pritchard
http://davidaknowles.github.io/leafcutter/
Apache License 2.0
207 stars 115 forks source link

No significant clusters found, but many differentially excised intron found with deltaPSI > 0.1 #131

Closed feiyang555 closed 4 years ago

feiyang555 commented 4 years ago

Dear All,

I have came across an Error: No significant clusters found when preparing the RData file to visualise the results, then I added the -f 0.15 flag to this step, I can visualise the data successfully. Please see the screenshot below. But when I look at the leafcutter_ds_effect_sizes.txt, there are indeed so many differential excised introns with deltaPSI > 0.1 or < -0.1. I am wondering why clusters including those introns didn't come out as significant clusters?

screenshot for shinyapp

Screen Shot 2020-03-22 at 9 15 45 pm

screenshot for leafcutter_ds_effect_sizes.txt

Screen Shot 2020-03-22 at 9 20 04 pm

Could you please let me know what happened to my results? please have a look at my results file and scripts

Screen Shot 2020-03-22 at 9 36 38 pm

leafcutter_ds_cluster_significance.txt leafcutter_ds_effect_sizes.txt groups_file.txt

Many thanks,

Fei

jackhump commented 4 years ago

Dear Fei,

I would point out that having a large deltaPSI does not always equate to a significant result. Looking at your effect sizes, only 67 out of 35,000 junctions have a dPSI > 0.1. My hunch would be that the two groups are very heterogeneous and that may be masking any true effect.

However it is puzzling that you do not see any significant differences between your groups. Do you have reason to believe that there would be splicing differences between the two groups? Do the two groups separate when you do a PCA of the junction counts or on the PCA of the junction ratios provided in the Shiny app?

Could there be technical or other confounding factors that you are not accounting for? You can add covariates as extra columns in the groups file you use in the differential splicing step. I would imagine you have a library preparation batches, RNA quality metrics like RIN. coverage metrics like % of reads mapping to mRNA bases etc that may be confounding your signal.

feiyang555 commented 4 years ago

Dear Jack,

I checked my PCA plots in shiny app, case and control didn't separate. Please see the picture below. Now I am trying to include more covariates including (batch effects, library pool, age, etc) and will update to you very soon.

On the other hand, could you please have a look at another issue # How to finalising the number of sQTLs https://github.com/davidaknowles/leafcutter/issues/133? I am a little bit confused about correction step, cause not sure how to deal with the fact that some introns of a cluster have same best SNP and different clusters may also have the same best SNP. I read the leafcutter paper, still don't understand what you mean by using Bonferroni correction to control for the number of introns tested per cluster. Do you mean I need to write a specific R loop script to run Bonferroni correction for each cluster? In this case, how to the different cluster may also have the best same SNP?

Many thanks,

Fei

Screen Shot 2020-04-10 at 10 33 45 am
jackhump commented 4 years ago

The variance explained is also very low for the first 2 PCs, which is odd. I'm suspicious about your data. Have you run PCA on normalised gene expression values? Do you see a similar lack of clustering?

feiyang555 commented 4 years ago

Dear Jack,

Thanks very much for your reminding. I I am looking for this part of results now cause I didn't do this part and will let you know later. From what I know, there is no significantly differentially expressed genes after Benjamini-Hochberg correction between two groups. If the analysis process is correct, I guess another potential reason might be that there is no too much differences between two groups, right?

Best regards,

Fei

jackhump commented 4 years ago

I see. Yes, I would be surprised to find differential splicing between two groups that didn't have any differential expressoin.