Potential bug in score method for sklearn API

stanfordmlgroup / ngboost

Natural Gradient Boosting for Probabilistic Prediction

Apache License 2.0

1.64k stars 215 forks source link

Hello,

Thank you for NGBoost! I think there may be a bug in the sklearn score API in the main NGBoost class (lines 317-318):

    def score(self, X, Y):  # for sklearn
        return self.Manifold(self.pred_dist(X)._params).total_score(Y)

Sklearn hyperparameter tuning functions like RandomizedSearchCV expect that better models are assigned higher (more positive) scores. However, as far as I can see, the total_score method above actually computes the total log loss, which means that smaller values correspond to better models. Should the score instead be the negative log loss?

stanfordmlgroup / ngboost

Potential bug in score method for sklearn API #218