Closed matmu closed 2 years ago
Hello matmu,
m1 is expected number of differential genes. You can get it by experience (for example, cell line getting less than 1% diff genes and human tissue samples getting more than 1% diff genes), Or doing differential detection in your prior data.
For power changes with m1 changes, it is related to FDR. For example, if you want to control 1% FDR, if you have 10,000 genes in total and 1% differential genes and 80% power, you can identify 80 true differential genes, or if you have 10% differential genes and 80% power, you can identify 800 true differential genes. To get the same FDR (1%), you will need less false positive genes with 80 true differential genes than with 800 true differential genes. So you will need smaller alpha to control type 1 error (false positive genes), which means you have less power with 1% differential genes than 10% differential genes.
I am using the
est_power_distribution()
functionRnaSeqSampleSize
for estimating the power for different sample sizes for an RNAseq experiment. What is not clear to me is on what basis to chosem1
which is the "expected number of prognostic genes". Is it the number of significant genes I identified in my own dataset? For the example below, the power increases when I set a higherm1
, e.g. 3 or 4. Why is it like that?