The discretization code currently does independent discretization of test and train set. This is a problem when the test and train sets do not follow same distribution.
Ideally, same bins should be used for both the sets. Saving the bins used while discretization of train set and using the same bins while discretizing the test set could be one solution.
The discretization code currently does independent discretization of test and train set. This is a problem when the test and train sets do not follow same distribution.
Ideally, same bins should be used for both the sets. Saving the bins used while discretization of train set and using the same bins while discretizing the test set could be one solution.