getting variable importance measure (iml) from a resample result using pipes.

mlr-org / mlr3pipelines

Dataflow Programming for Machine Learning in R

GNU Lesser General Public License v3.0

137 stars 25 forks source link

Hi, thanks for all the work on the ml3 package eco system!

I am trying to calculate variable importance as described here after fitting an xgboost model along these lines:

task = as_task_classif(dt, task_type = "classif", target = "Y")

my_pipe =
  po("scale") %>>%
  po("encode") %>>%
  po("learner", learner =  lrn("classif.xgboost"))

G_learner = GraphLearner$new(my_pipe)

rr = tune_nested(
        task = task,
        learner = G_learner,
        ...
      )

This call returns a "ResampleResult" R6 object. But I am unable to figure out how to calculate variables importance as described here or to find the LearnerClassifXgboost object in order to use the importance method as described here.

I also found this stackoverflow question & answer and based on it unsucessfully tried to calculate variable importance from an autotuner object in the ResampleResult rr.

So my question is: How can I calculate variable importance given a ResampleResult?

mlr-org / mlr3pipelines

getting variable importance measure (iml) from a resample result using pipes. #685