Closed sergeyf closed 8 years ago
Erm, maybe I'm being dumb here. BiScaler learns row means, but the test set has totally different rows. Is this thing really not applicable to non-transductive cases? Do you just have to apply it separately to X_train and X_test? That is, do fit_transform twice? That seems... wrong?
I think that the only way to apply non-transductively is to biscale a training set, save the feature parameters (mean/std) and then row scale the test set.
Should that be the default behavior of transform
? This might make inverse_transform
particularly confusing since it could only be used on the most recent dataset for transform
was called.
I wonder if this kind of non-transductive application would totally ruin predictive power... Best to just leave it alone I think
I tried to stick
BiScaler
into a scikit-learn pipeline as follows:But it doesn't seem to work. I get the following error:
when trying this code: