guillermo-navas-palencia / optbinning

Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
http://gnpalencia.org/optbinning/
Apache License 2.0
434 stars 98 forks source link

The feature of numpy parallel can improve the efficiency of variable transform #318

Closed cfkstat closed 1 month ago

cfkstat commented 1 month ago

When I transform large amounts of data, such as more than 2 million, the transform efficiency is relatively low, and the python package dpnp supports parallelism。

guillermo-navas-palencia commented 1 month ago

Hi @cfkstat.

Thanks for proposing this enhancement. I understand that transform for large datasets might be slow, but dpnp is an additional dependency which might only benefit Intel processors. Therefore, I discard its usage.