Closed feiyang555 closed 4 years ago
Dear Fei,
I would point out that having a large deltaPSI does not always equate to a significant result. Looking at your effect sizes, only 67 out of 35,000 junctions have a dPSI > 0.1. My hunch would be that the two groups are very heterogeneous and that may be masking any true effect.
However it is puzzling that you do not see any significant differences between your groups. Do you have reason to believe that there would be splicing differences between the two groups? Do the two groups separate when you do a PCA of the junction counts or on the PCA of the junction ratios provided in the Shiny app?
Could there be technical or other confounding factors that you are not accounting for? You can add covariates as extra columns in the groups file you use in the differential splicing step. I would imagine you have a library preparation batches, RNA quality metrics like RIN. coverage metrics like % of reads mapping to mRNA bases etc that may be confounding your signal.
Dear Jack,
I checked my PCA plots in shiny app, case and control didn't separate. Please see the picture below. Now I am trying to include more covariates including (batch effects, library pool, age, etc) and will update to you very soon.
On the other hand, could you please have a look at another issue # How to finalising the number of sQTLs https://github.com/davidaknowles/leafcutter/issues/133? I am a little bit confused about correction step, cause not sure how to deal with the fact that some introns of a cluster have same best SNP and different clusters may also have the same best SNP. I read the leafcutter paper, still don't understand what you mean by using Bonferroni correction to control for the number of introns tested per cluster. Do you mean I need to write a specific R loop script to run Bonferroni correction for each cluster? In this case, how to the different cluster may also have the best same SNP?
Many thanks,
Fei
The variance explained is also very low for the first 2 PCs, which is odd. I'm suspicious about your data. Have you run PCA on normalised gene expression values? Do you see a similar lack of clustering?
Dear Jack,
Thanks very much for your reminding. I I am looking for this part of results now cause I didn't do this part and will let you know later. From what I know, there is no significantly differentially expressed genes after Benjamini-Hochberg correction between two groups. If the analysis process is correct, I guess another potential reason might be that there is no too much differences between two groups, right?
Best regards,
Fei
I see. Yes, I would be surprised to find differential splicing between two groups that didn't have any differential expressoin.
Dear All,
I have came across an Error: No significant clusters found when preparing the RData file to visualise the results, then I added the -f 0.15 flag to this step, I can visualise the data successfully. Please see the screenshot below. But when I look at the leafcutter_ds_effect_sizes.txt, there are indeed so many differential excised introns with deltaPSI > 0.1 or < -0.1. I am wondering why clusters including those introns didn't come out as significant clusters?
screenshot for shinyapp
screenshot for leafcutter_ds_effect_sizes.txt
Could you please let me know what happened to my results? please have a look at my results file and scripts
leafcutter_ds_cluster_significance.txt leafcutter_ds_effect_sizes.txt groups_file.txt
Many thanks,
Fei