AESPCA function - Add Non-Parametric p-Value Option

gabrielodom / pathwayPCA

integrative pathway analysis with modern PCA methodology and gene selection

https://gabrielodom.github.io/pathwayPCA/

11 stars 2 forks source link

AESPCA function - Add Non-Parametric p-Value Option #50

Closed gabrielodom closed 5 years ago

gabrielodom commented 5 years ago

Per the PGDAC meeting with Bing's group, Steven mentioned that we should compare the raw (parametric) $p$-values to the permutation $p$-values to see if there is a clear relationship. Thus we need an option to AESPCA_pVals() to request the raw $p$-value.

gabrielodom commented 5 years ago

See the Identification of DMRs subsection in https://www.nature.com/articles/mp2013114.pdf

gabrielodom commented 5 years ago

We have three components to analyse here:

CoxPH Model overall p-values (PermTestSurv())
GLM overall p-values (PermTestCateg())
LM overall p-values (PermTestReg())

Based on the results in the testing files Test_PermTestSurv_Parametric.R and Test_PermTestCateg_Parametric.R, the Pearson correlation between the raw parametric and raw permutation-based p-values for CoxPH and GLM is very high (0.95 or better, even for 100 replicates). However, the FDRs in the GLM case are not as strongly correlated (0.67 for 100 replicates, 0.85 for 1000). I do not yet have results for the LM case.

lxw391 commented 5 years ago

@gabrielodom Please update function using anova(true_mod, test = "LRT")

Also please change AIC to anova function instead to perform LRT test.

gabrielodom commented 5 years ago

Because this is not related to the parametric p-value estimation discussion, I've created a new issue: Issue #57.

gabrielodom commented 5 years ago

For the LM, see Test_PermTestReg_Parametric.R.

Now, all three methods support parametric p-value estimation. To estimate the pathway p-values parametrically, set numReps = 0 in the AESPCA_pVals() function call. Overall, the raw p-values between the parametric and non-parametric options are nearly perfectly correlated (rho > 0.98 even for 100 replicates).

gabrielodom commented 5 years ago

As we see in the attached image, test size does not appear to be a major concern.