YuLab-SMU / clusterProfiler

:bar_chart: A universal enrichment tool for interpreting omics data
https://yulab-smu.top/biomedical-knowledge-mining-book/
1.02k stars 256 forks source link

Differences to topGO enrichment #511

Open chrarnold opened 2 years ago

chrarnold commented 2 years ago

I am having some questions about the enrichGO function, and its comparability with the results obtained from the topGO package. I'd appreciate if someone can comment on my questions, I think this is of general help:

  1. Are there any methodological differences between the two packages? Which type of algorithm / test combination clusterProfiler'' corresponds best as compared totopGOwhen using it in the default mode with the Fisher statistic and theweight01` algorithm?
  2. p-value adjustment: enrichGO returns p.adjust and qvalue, while topGO returns only raw p-values that, however, are claimed to "not be affected by the multiple testing problem because they are not independent due to the nestedness and correlation of neighboring GO terms". Details can be found here (https://datacatz.wordpress.com/2018/01/19/gene-set-enrichment-analysis-with-topgo-part-1/). The authors also note in the paper (https://academic.oup.com/bioinformatics/article/22/13/1600/193669). What is your take here, especially because you report adjusted p and q values?

Thank you!

chrarnold commented 2 years ago

When I look at a correlation plot of topGO p-values and clusterProfiler p-values (unadjusted), I see a wild mixture: