Classification tasks are sometimes not recognized

dandls / counterfactuals

counterfactuals: An R package for Counterfactual Explanation Methods

GNU Lesser General Public License v3.0

21 stars 4 forks source link

I haven't had time to investigate this in detail, but it seems that there are cases where our classification methods do not correctly recognize classification tasks:

library(counterfactuals)
library(iml)
library(dplyr)
library(tidymodels)

data(german, package = "rchallenge")

credit = german[, c("duration", "amount", "purpose", "age",
                    "employment_duration", "housing", "number_credits", "credit_risk")]

x_interest = credit[998L,]
rf = rand_forest(mode = "classification", engine = "randomForest") %>%
  fit(credit_risk ~ ., data = credit[-998L,])

pred = Predictor$new(model = rf, data = credit[-998L,], y = "credit_risk")
nice_classif = MOCClassif$new(pred)
#> Error in super$initialize(predictor, lower, upper, distance_function): MOCClassif only works for classification tasks.

However, if we explicitely set the prob argument in the Predictor$init() method, it works:

library(counterfactuals)
library(iml)
library(dplyr)
library(tidymodels)
data(german, package = "rchallenge")

credit = german[, c("duration", "amount", "purpose", "age",
                    "employment_duration", "housing", "number_credits", "credit_risk")]

x_interest = credit[998L,]
rf = rand_forest(mode = "classification", engine = "randomForest") %>%
  fit(credit_risk ~ ., data = credit[-998L,])

pred = Predictor$new(model = rf, data = credit[-998L,], y = "credit_risk", type = "prob")
nice_classif = MOCClassif$new(pred)

library(iml) library(tidymodels) data(german, package = "rchallenge") credit = german[, c("duration", "amount", "purpose", "age", "employment_duration", "housing", "number_credits", "credit_risk")] # tidymodels rf = rand_forest(mode = "classification", engine = "randomForest") %>% fit(credit_risk ~ ., data = credit) pred = Predictor$new(model = rf, data = credit, y = "credit_risk") pred$task #> [1] "unknown" pred$task = NULL pred$predict(credit[c(1, 2),]) pred$task #> [1] "regression" iml:::inferTaskFromPrediction(prediction = pred$predict(credit[c(1, 2),])) #> [1] "regression"

dandls / counterfactuals

Classification tasks are sometimes not recognized #29