Caleydo / caleydo

Caleydo - Visualization for Molecular Biology
http://caleydo.org
BSD 3-Clause "New" or "Revised" License
58 stars 14 forks source link

Statistical test: Wilcoxon rank sum test #1860

Closed wassermai closed 9 years ago

wassermai commented 10 years ago

Implement Wilcoxon rank sum test in addition to Fisher's exact test. Test explanation: http://en.wikipedia.org/wiki/Mann%E2%80%93Whitney_U_test

Fisher's exact test is better for categorical data (mutation status), whereas the Wilcoxon rank sum test is better for continuous data (expression levels, copy numbers).

cpartl commented 10 years ago

I do have 2 basic questions regarding this test:

wassermai commented 10 years ago

The test would be applied to two groups of cell lines - those that are sensitive and those that are insensitive to a drug. The cell lines can be partitioned into two groups by setting an AC50 threshold of 8 uM for the CCLE data set. Then, for these two groups, you get distributions of either copy numbers or expression levels of a selected gene. No normalization is needed as gene expression is compared to gene expression and copy numbers are compared to copy numbers. The test then detects differences in the distribution and can tell us whether the sensitive cell lines have a significantly higher or lower expression of the selected gene or, when applied to copy numbers, significantly more or fewer copies of a gene are present.

cpartl commented 10 years ago

Ah, this makes much more sense now :-)