JakeColtman / bartpy

Bayesian Additive Regression Trees For Python
https://jakecoltman.github.io/bartpy/
MIT License
219 stars 44 forks source link

Feature Selection #33

Closed JakeColtman closed 5 years ago

JakeColtman commented 5 years ago

Resolution for #26

JakeColtman commented 5 years ago

The current approach to selecting feature importance requires generating lots of permutations of the model to produce a null distribution of feature inclusions. At the moment, there's no way to run models in parallel because joblib can't be nested. It would be nice to expose the chains to an external runner, so that all chains from all models can be run at once

JakeColtman commented 5 years ago

Having played around with the current implementation, two things strike:

JakeColtman commented 5 years ago

Still to do - wrap the logic up into an Sklearn BaseFilter

JakeColtman commented 5 years ago

Added to the PR: