guillermo-navas-palencia / optbinning

Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
http://gnpalencia.org/optbinning/
Apache License 2.0
434 stars 98 forks source link

Better handling dtypes #316

Open lcrmorin opened 2 months ago

lcrmorin commented 2 months ago

For the moment the data type need to be provided manually, accepting 'numerical' or 'categorical', with default being 'numerical'.

For quality of life I would suggest:

Any thoughts on these proposals ?

guillermo-navas-palencia commented 2 months ago

Hi @lcrmorin.

There is a good reason to avoid inferring types directly (although this is done in BinningProcess for obvious reasons). The main problem occurs when dealing with integer variables, there is no automatic process to distinguish between ordinal and categorical.

lcrmorin commented 2 months ago

I feel like: