Closed forecastingEDs closed 1 year ago
Hello @forecastingEDs 👋
This is cross-posted from https://stackoverflow.com/questions/76507745/conflicts-or-errors-in-the-tune-and-tidymodels-packages-using-information-gain-f and https://github.com/tidymodels/tidymodels/issues/109
This bug was introduced in https://github.com/stevenpawley/colino/commit/b6f15487029eaabf6a147383d290104d35cd4854#diff-861702901efb987346f4a1cefaad0ac436f9f8a3a6b1385e12a77a32edbca645, this PR should fix that problem. https://github.com/stevenpawley/colino/pull/7
If you urgently need these changes, you can install {colino} from that branch with pak::pak("stevenpawley/colino#7")
Dear @EmilHvitfeldt
Thanks a lot for providing the bug-fixed version. #7 I managed to install using devtools::install_github("stevenpawley/colino#7")
I thank you immensely!** @EmilHvitfeldt
Merged! thanks for the fix and for reporting this!
Hello @stevenpawley , Please help me!
This code was working until yesterday, when I uninstalled the
recipeselectors
andcolino
packages. I've looked everywhere and it looks like other users have recently reported the same error:https://stackoverflow.com/questions/76463339/error-in-purrrmap-unable-to-use-step-select-from-colino-and-workflow/76508232
There seems to be a conflict between
recipeselectors
andcolino
in which they both want to run thestep_select_infgain
function. I already tried deleting one of the two packagesrecipeselectors
orcolino
, but the error remains. Until two days ago my script was working normally, but I needed to install therecipeselectors
andcolino
packages and the error started to appear.Link to download the database used:
https://github.com/forecastingEDs/Forecasting-of-admissions-in-the-emergency-departments/blob/131bd23723a39724ad4f88ad6b8e5a58f42a7960/datasets.xlsx
Repex reproducible example
*** Load the following R packages ----
Preparing data for preprocessing with recipe
Full = Training + Forecast Datasets
Apply Group-wise Time Series Manipulations
Consolidate IDs
Training Data
Forecast Data
** Summary Diagnostics. Let us check the regularity of all time series with timetk::tk_summary_diagnostics()
** Check for summary of timeseries data for training set
Data Splitting ----
Now we set aside the future data (we would only need that later when we make forecast)
And focus on training data
* 4.1 Panel Data Splitting ----
Split the dataset into analyis/assessment set
Feature Selection and preprocessing ----
Information gain feature selection ----
*** Model 4: GLMNET ----
See that if we run the model training without the step of selecting variables for information gain, it works normally generating the grid search. It could be some package conflict that I am not able to identify because the preprocessing with the information gain step was working normally
result
A tibble: 1 × 8
See that if we run the model training without the step of selecting variables for information gain, it works normally generating the grid search. It could be some package conflict that I am not able to identify because the preprocessing with the information gain step was working normally
result
A tibble: 1 × 8