greenelab / pancancer-evaluation

Evaluating genome-wide prediction of driver mutations using pan-cancer data
BSD 3-Clause "New" or "Revised" License
9 stars 3 forks source link

Add statistical testing to compare single-cancer and pan-cancer results #15

Open jjc2718 opened 3 years ago

jjc2718 commented 3 years ago

A t-test with FDR correction would probably work fine as a first pass at this.

We should also compare signal to shuffled prediction.

jjc2718 commented 3 years ago

This was somewhat addressed in #20, but we're currently using an unpaired t-test. It's likely that a paired t-test would have slightly better power, but we need to verify that the cross-validation folds being compared are actually the same between runs, in order to use the paired test.