init impl - Githubissues

csinva / imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

MIT License

1.35k stars 120 forks source link

@csinva I added support for categorical variables for FIGS.

The interface is that the user should specify the names of the columns that are categorical (we assume that X is a pd.DataFrame in this case). Then I created a function encode_categories in the imodels.util.data_util file that transforms the data matrix into one-hot encoding and saves the encoder. Then if only some of the categories are available for inference the matrix would still have the same dimension. I also added a basic test for it.

The clalit people asked for this functionality, let me know what you think!

csinva / imodels

init impl #151