The model build fails with the following message:
c50 code called exit with value 1
summary(model) produces the following:
Call:
C5.0.formula(formula = form, data = indata, trials = 1, control
= C5.0Control(subset = FALSE, winnow = TRUE, noGlobalPruning = FALSE))
C5.0 [Release 2.07 GPL Edition] Sat Jul 31 11:52:01 2021
-------------------------------
*** line 7 of `undefined.names': missing name or value before `,'
Error limit exceeded
The value of model$names is:
[1] "| Generated using R version 4.0.5 (2021-03-31)\n| on Sat Jul 31 11:59:19 2021\noutcome.\n\noutcome: 0,1.\nage: continuous.\njob: management,technician,entrepreneur,blue-collar,unknown,retired,admin.,services,,self-employed,unemployed,housemaid,student.\nmarital: ,single,married,divorced.\neducation: tertiary,secondary,unknown,primary,.\ndefault: continuous.\nbalance: continuous.\nhousing: yes,,no.\nloan: continuous.\ncontact: unknown,cellular,telephone.\nday: continuous.\nmonth: may,,jun,jul,aug,oct,nov,dec,jan,feb,mar,apr,sep.\nduration: continuous.\ncampaign: continuous.\npdays: continuous.\nprevious: continuous.\npoutcome: unknown,failure,other,success.\n"
Change the value of indataName to "bankmkt_part1.csv" (which has no missings) and the model is built normally. A zipfile containing the R script and the two datasets is attached here.
c50bug.zip
Consider the following command sequence:
The model build fails with the following message:
c50 code called exit with value 1
summary(model)
produces the following:The value of model$names is:
[1] "| Generated using R version 4.0.5 (2021-03-31)\n| on Sat Jul 31 11:59:19 2021\noutcome.\n\noutcome: 0,1.\nage: continuous.\njob: management,technician,entrepreneur,blue-collar,unknown,retired,admin.,services,,self-employed,unemployed,housemaid,student.\nmarital: ,single,married,divorced.\neducation: tertiary,secondary,unknown,primary,.\ndefault: continuous.\nbalance: continuous.\nhousing: yes,,no.\nloan: continuous.\ncontact: unknown,cellular,telephone.\nday: continuous.\nmonth: may,,jun,jul,aug,oct,nov,dec,jan,feb,mar,apr,sep.\nduration: continuous.\ncampaign: continuous.\npdays: continuous.\nprevious: continuous.\npoutcome: unknown,failure,other,success.\n"
Change the value of
indataName
to "bankmkt_part1.csv" (which has no missings) and the model is built normally. A zipfile containing the R script and the two datasets is attached here. c50bug.zip