broadinstitute / regional_missense_constraint

Code to calculate regional missense constraint
BSD 3-Clause "New" or "Revised" License
12 stars 1 forks source link

Expand MPC code to evaluate regression models with all combinations of variables including BLOSUM/Grantham #289

Closed Ruchit10 closed 1 year ago

Ruchit10 commented 1 year ago

Major Changes:

  1. Include BLOSUM and Grantham scores into pool of variables for running single + joint regressions.
  2. Evaluate linear regression models for all possible combinations of variables including MPC + BLOSUM + Grantham and pick one with lowest AIC.

Minor changes:

  1. Update documentation of additional_variables in function description run_regressions since it will now be added into combined variable pool for single + joint regressions.
  2. Fix a bug causing single variables regression model with lowest AIC to output the model parameters instead of stats model object.
  3. Replace all_model_aic with just the lowest AIC from each category of regression (single, additive, multiplicative and special) since having a long list of AICs from each combination is irrelevant in downstream steps.