-
I believe feature importance was recently implemented in the underlying python code - would it be possible to add access to it?
-
Here is my code:
X_train_1, y_train_1 = load_svmlight_file('test.txt')
clf = RandomForestClassifier(n_estimators=10, n_jobs=-1, class_weight="balanced")
clf = clf.fit(X_train_1, y_train_1)
from sklea…
-
I'm using a big test dataset (12500 k rows) for prediction. `predict_proba` takes about one day using autosklearn model, formed by only two pipelines, meanwhile it's takes only 4 hours with Random For…
-
Hi,
I am using dragnet in combination with scikit-learn. I would like to use scikit-learn 0.18 together with dragnet, which is a little bit unconvenient, since dragnet depends on an older version o…
-
I am using Ensemble Classifiers from the package and was trying to create dynamic selection classifier from the following example. So my code snippet is like this:
```
# Initializing ensemble of dif…
-
[ 0.000000] Linux version 4.19.9 (vagrant@ubuntu-16) (gcc version 7.4.0 (OpenWrt GCC 7.4.0 r1164-730afc3)) #0 SMP Tue Jan 15 02:39:04 2019
[ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz roo…
-
I just noticed that there are some requests for integration with PySpark http://dmlc.ml/2016/03/14/xgboost4j-portable-distributed-xgboost-in-spark-flink-and-dataflow.html
I also received some emails …
-
I'm doing binary classification on a large dataset. This is what my code looks like.
```
clf = catboost.CatBoostClassifier(thread_count=20, verbose=True, iterations=100, l2_leaf_reg=3)
clf.fit(df…
-
I have thought to how make more people use Xgboost R package, and I think more documentation may help.
Lots of R package have a Vignette too light (copy paste of the R functions documentation) or to…
-
Hi,
I would like to access to several levels of detailed information from the leaf nodes of a RF classifier.
Right after training, I would need to get the training samples which end up in each node…