-
Right now, we have KNN, KRR, GridentBoosting, we need to try more to develop the sense about the choice of methods for different types of problems. Let's make the list here.
- RandomForest
- Stoch…
-
Had someone show me this error on their dataset and I was able to duplicate it with some existing data in qiita. Just trying to run a random forest sample classifier on one categorical data with a non…
-
I’ve been playing with dask for a while and as a incremental model fitting learning exercise, have made some extensions to the sklearn forest ensembles. Basically, it’s the addition of a .partial_fit(…
-
Sometimes one would like to use a control sample, e.g. because more abundant, to determine MC weights to be then applied to other, e.g. more rare, samples
For this reason it would be very useful if h…
-
SHAP~HEAD
- This issue ONLY arises for certain random datasets (e.g. if we change the seed sometimes it works!)
- Additionally, if we comment out the model.predict_proba() step, there is also no e…
-
The parameters **subsample** and **max_features** in GradientBoostingRegressor are useful. Is it possible to add equivalent parameters to HistGradientBoostingRegressor?
-
#### Describe the bug
`AdaBoostClassifier.feature_importances_` makes a weighted average of importances...
https://github.com/scikit-learn/scikit-learn/blob/9b7ff272534f130893e95933db46a3ff29519…
-
While HistGradientBoostingClassifier is 100 faster than GradientBoostingClassifier when fitting the model, I found it to be very slow in case of predicting the class probabilities, in my case about 10…
-
External workflows, e.g. the once from the training material, should be updated regularly. Or even better should be a linked against the training material. Maybe we can pull them down before running t…
-
### ML-Crate Repository (Proposing new issue)
:red_circle: **Project Title** : Medical Recommendation System
:red_circle: **Aim** : A personalized medical recommendation system to assist users in u…