ATOMScience-org / AMPL

The ATOM Modeling PipeLine (AMPL) is an open-source, modular, extensible software pipeline for building and sharing models to advance in silico drug discovery.
MIT License
136 stars 68 forks source link

Changes to control model sparsity and MTSS improvements #331

Open mcloughlin2 opened 4 months ago

mcloughlin2 commented 4 months ago

Improvements to MultitaskScaffoldSplitter:

New module split_diagnostic_plots:

Sparsity-related parameters for XGBoost models:

New search domain parameters for hyperopt optimization of sparsity parameters:

Feature_importance function to draw line plot of summed NN absolute feature weights vs epoch.

paulsonak commented 3 months ago

Hi, I have tested the following so far mainly by using various tutorial notebooks and running the functions through there. I will update this in a bit when I am finished trying out the rest of the features.

Improvements to MultitaskScaffoldSplitter:

Comments:

New module split_diagnostic_plots:

Comments:

Sparsity-related parameters for XGBoost models:

New search domain parameters for hyperopt optimization of sparsity parameters:

Feature_importance