Closed pidefrem closed 5 months ago
Hello @pidefrem ! Thank you for this contribution. I'll take the time to review it. Just to be in line with your proposal, I understand that in Allow MapieRegressor to use group split strategy
, the split
strategy refers to the way of doing cross-validation but that it is a cross conformal method.
Don't hesitate to contact me if you need any help.
Hello @thibaultcordier, yes it refers to the split methods of the cross validator used during the fit of the MAPIE estimators. Please feel free to suggest any other description that you think is more suitable.
All modified and coverable lines are covered by tests :white_check_mark:
Comparison is base (
614293e
) 100.00% compared to head (b9f25fa
) 100.00%. Report is 210 commits behind head on master.
:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.
Wonderful! Thank you for this very welcome contribution. I don't have many comments to add but just a few to complete your proposal exhaustively:
- some style code suggestions
- As far as the tests are concerned, you are indeed testing whether MAPIE returns the same results using the same group (constant or None). But what happens if you use different groups like
np.concat([np.ones(shape=n_samples/2), 2*np.ones(shape=n_samples/2)])
(untested proposal).- duplicate the tests in the classification test file because changes have been made in this part of the code.
@thibaultcordier I fixed some issues and added some tests, tell me if it is ok now.
Wonderful! Thank you for this very welcome contribution. I don't have many comments to add but just a few to complete your proposal exhaustively:
- some style code suggestions
- As far as the tests are concerned, you are indeed testing whether MAPIE returns the same results using the same group (constant or None). But what happens if you use different groups like
np.concat([np.ones(shape=n_samples/2), 2*np.ones(shape=n_samples/2)])
(untested proposal).- duplicate the tests in the classification test file because changes have been made in this part of the code.
@thibaultcordier I fixed some issues and added some tests, tell me if it is ok now.
Hello @pidefrem, I'll check your changes this week. Thank you for contacting me about the review. I'll keep you informed.
Description
Continuing the work done in PR Allow the use of
GroupKFold
cv-split (see https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.GroupKFold.html) and also the use of custom cv-splits based onStratifiedKFold
for example (see https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.StratifiedKFold.html#sklearn.model_selection.StratifiedKFold)Fixes #202
Type of change
How Has This Been Tested?
Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration
Checklist
make lint
make type-check
make tests
make coverage
make doc