A lightweight training and evaluation framework for reliable determination of synthesizability (EFA) of high entropy materials.
See the associated publication for more information about EFA: https://www.nature.com/articles/s41524-020-0317-6
This project targets a CPU-enabled Linux workstation. Additional work may be required for testing on other operating systems.
We target Python 3.6+.
pip install -e .
This project includes a cli.py
script for training a Random Forest Regressor with the following assumptions:
The cli.py
script returns several artifacts including the fit model, a table of feature importances, the hyperparamter search results and optimal hyperparameters found, and a transform mask (if feature selection is performed).
This project includes a predict.py
script for predicting the EFA of new data. The following assumptions are made:
The predict.py
script returns a .csv file containing the material name and the predicted EFA.
This project includes a predict.py
script for exporting all of the decision trees in a random forest as png files.
This script assumes:
model_path = '../model_checkpoints/model_checkpoint_wCalphad_gs_2019-09-17-02-09.joblib'
transform_mask_init = pd.read_csv('../transform_mask/FILENAME_HERE.csv')
Predictor-Variable-Definitions.pdf