Closed inti4digbi closed 8 months ago
Hey @inti4digbi,
Thank you for this issue provided. Indeed, I don't think this is something that could be fixed within our framework. I believe that you could use a class wrapper and re-model the predict function to output only the prediction (checkout #340).
Also, I am not an expert with regard to CatBoost and using the loss_function
with RMSEWithUncertainty
but if you only plan to use the prediction and not the uncertainty estimates, is that not simply using the loss_function
with RMSE
?
Do not hesitate to ask if you have any further questions,
Thank you!
Thank you
On Thu, Mar 14, 2024 at 9:38 AM Louis Lacombe @.***> wrote:
Hey @inti4digbi https://github.com/inti4digbi,
Thank you for this issue provided. Indeed, I don't think this is something that could be fixed within our framework. I believe that you could use a class wrapper and re-model the predict function to output only the prediction (checkout #340 https://github.com/scikit-learn-contrib/MAPIE/issues/340).
Also, I am not an expert with regard to CatBoost and using the loss_function with RMSEWithUncertainty but if you only plan to use the prediction and not the uncertainty estimates, is that not simply using the loss_function with RMSE? Do not hesitate to ask if you have any further questions,
Thank you!
— Reply to this email directly, view it on GitHub https://github.com/scikit-learn-contrib/MAPIE/issues/423#issuecomment-1997362621, or unsubscribe https://github.com/notifications/unsubscribe-auth/AWUFMWHEXZTI2BQ4RHP6GYDYYGK6BAVCNFSM6AAAAABEM2BFQCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOJXGM3DENRSGE . You are receiving this because you were mentioned.Message ID: @.***>
Describe the bug
CatBoost has the
loss_function
RMSEWithUncertainty
which uses estimated variance for the predictions during training. I have found it to perform better thanRMSE
on some applications. However, I do not want to use the uncertainty estimates it provides.When using the models with MAPIE
I get the error message i have paste below. I checked on the code and found the reason for the error. When using the
predict
method in a models with theRMSEWithUncertainty
loss the output has has shape(n_samples,2)
with the columns being the predicted mean and predicted variance. MAPIE expects shape(n_samples,)
.I can imagine a simple solution where if the output has more than 1 columns then the first one is used as the predicted mean. However, I can see how this could break in cases where the
predict
methods produce columns arranged in a different manner depending of whatever reason the developers decided.It would be great if you could support this.
Many thanks in advance
To Reproduce Steps to reproduce the behavior:
Expected behavior A clear and concise description of what you expected to happen.
Screenshots If applicable, add screenshots to help explain your problem.
Desktop (please complete the following information):
Additional context Add any other context about the problem here.