Closed sibipx closed 2 years ago
Looks like this is a data.table scoping problem that occurs in completeData. Very weird this occurs for the carat
column and not for any others.
I've pushed a fix, version 1.5.1. It's not on CRAN yet. @sibipx can you install from github and ensure you see it is fixed too?
it looks fine now, thanks!
> miceRanger_imp_model <- miceRanger::miceRanger(data_train_miss, m = 2, maxiter = 2,
+ valueSelector = "meanMatch",
+ returnModels = TRUE,
+ verbose = TRUE)
Process started at 2022-05-20 11:02:43
dataset 1
iteration 1 | carat | cut | color | clarity | depth | table | price | x | y | z
iteration 2 | carat | cut | color | clarity | depth | table | price | x | y | z
dataset 2
iteration 1 | carat | cut | color | clarity | depth | table | price | x | y | z
iteration 2 | carat | cut | color | clarity | depth | table | price | x | y | z
> data_test_imp_miceRanger <- miceRanger::impute(data_test_miss,
+ miceRanger_imp_model, verbose = TRUE)
dataset 1
iteration 1 | carat | cut | color | clarity | depth | table | price | x | y | z
iteration 2 | carat | cut | color | clarity | depth | table | price | x | y | z
dataset 2
iteration 1 | carat | cut | color | clarity | depth | table | price | x | y | z
iteration 2 | carat | cut | color | clarity | depth | table | price | x | y | z
This problems happens on diamonds dataset and I am unsure why. The problem only happens when valueSelector = "meanMatch". It works fine with valueSelector = "value".
If there is any way to workaround this problem (other than not using PMM), please let me know.
See example below.
Thanks!