JakeColtman / bartpy

Bayesian Additive Regression Trees For Python
https://jakecoltman.github.io/bartpy/
MIT License
219 stars 44 forks source link

Switch to using numpy masks in nodes rather than full copies of the covariate matrix #38

Closed JakeColtman closed 5 years ago

JakeColtman commented 5 years ago

The main change in this PR is to reshuffle how Data works. Rather than every split causing a deepcopy of the X and y matrices, it now only creates new masks onto a single instance of X and y. This makes the process much faster and lighter on memory

There's also an embarrassing splodge of other changes at the same time :(