Eventually we'll need to further process X and Y so that
X does not contain object_id
NaN values in X and Y are filled with a token value, like -9999
The positions of filled NaN values in Y are presaved as a mask, so that they are not counted toward the loss (we shouldn't penalize the algorithm for not getting answers that weren't there)
This can be an internal helper method _preprocess_for_training that is called by make_training_sets.
Eventually we'll need to further process
X
andY
so thatX
does not containobject_id
X
andY
are filled with a token value, like -9999Y
are presaved as a mask, so that they are not counted toward the loss (we shouldn't penalize the algorithm for not getting answers that weren't there)This can be an internal helper method
_preprocess_for_training
that is called bymake_training_sets
.