kogalur / randomForestSRC

DOCUMENTATION:
https://www.randomforestsrc.org/
GNU General Public License v3.0
115 stars 18 forks source link

Crashing rfsrc.fast and rfsrc #224

Open RacioBR opened 2 years ago

RacioBR commented 2 years ago

Hi,

I am trying to develop a Random Forest for a strong imbalanced data (99,7% vs 0,3% sucess) with 205.546 observations and 35 variables. I am using Xubuntu (see some Hardware info below) and when I tried to run the following command:

rfsrc.fast(DependentVariable ~ .,dataUsed, ntree = 1000, perf.type = "gmean", do.trace = TRUE, statistics = TRUE)

I received the following output:

"Trees Grown: 1, Time Remaining (sec): 999 Trees Grown: 82, Time Remaining (sec): 34 Trees Grown: 165, Time Remaining (sec): 26 Trees Grown: 242, Time Remaining (sec): 22 Trees Grown: 323, Time Remaining (sec): 19 Trees Grown: 402, Time Remaining (sec): 17 Trees Grown: 483, Time Remaining (sec): 14 Trees Grown: 564, Time Remaining (sec): 12 Trees Grown: 642, Time Remaining (sec): 10 Trees Grown: 723, Time Remaining (sec): 8 Trees Grown: 802, Time Remaining (sec): 6 Trees Grown: 884, Time Remaining (sec): 4 Trees Grown: 966, Time Remaining (sec): 1

RF-SRC: ERROR RF-SRC: Stack imbalance in PROTECT/UNPROTECT: 5 + 1 versus 7
RF-SRC: Please Contact Technical Support.<simpleError in doTryCatch(return(expr), name, parentenv, handler): RF-SRC: The application will now exit.

Error in rfsrc(data = list(RegiaoBrasil = c(1L, 1L, 1L, 1L, 1L, 1L, 1L, : An error has occurred in the grow algorithm. Please turn trace on for further analysis."

Apparently, some problem with C. I used RStudio initially and got mostly the same output using Rscript in Terminal (only adding "Calls: rfsrc.fast -> do.call -> rfsrc Execution halted")

Any clue how to solve this?

Thanks in advance.

Rinaldo.

-------------------------------------------------- x -------------- --- Hardware Info --- description: Notebook product: 20ANCTO1WW (LENOVO_MT_20AN_BU_Think_FM_ThinkPad T440p) vendor: LENOVO version: ThinkPad T440p serial: PB01HAWN width: 64 bits capabilities: smbios-2.7 dmi-2.7 smp vsyscall32 configuration: administrator_password=disabled chassis=notebook family=ThinkPad T440p power-on_password=disabled sku=LENOVO_MT_20AN_BU_Think_FM_ThinkPad T440p uuid=019CBBC6-8953-CB11-974E-B36EE3F1F2CD -core description: Motherboard product: 20ANCTO1WW vendor: LENOVO physical id: 0 version: SDK0E50510 PRO serial: L1HF42E0315 slot: Not Available -cpu description: CPU product: Intel(R) Core(TM) i7-4700MQ CPU @ 2.40GHz vendor: Intel Corp. physical id: 0 bus info: cpu@0 version: Intel(R) Core(TM) i7-4700MQ CPU @ 2.40GHz serial: None slot: CPU Socket - U3E1 size: 2397MHz capacity: 3400MHz width: 64 bits clock: 100MHz capabilities: lm fpu fpu_exception wp vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp x86-64 constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm cpuid_fault epb invpcid_single pti ssbd ibrs ibpb stibp fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid xsaveopt dtherm ida arat pln pts md_clear flush_l1d cpufreq configuration: cores=4 enabledcores=4 threads=8 -cache:0 description: L1 cache physical id: 2 slot: L1-Cache size: 32KiB capacity: 32KiB capabilities: asynchronous internal write-back instruction configuration: level=1 -cache:1 description: L2 cache physical id: 3 slot: L2-Cache size: 256KiB capacity: 256KiB capabilities: asynchronous internal write-back unified configuration: level=2 -cache:2 description: L3 cache physical id: 4 slot: L3-Cache size: 6MiB capacity: 6MiB capabilities: asynchronous internal write-back unified configuration: level=3 -cache description: L1 cache physical id: 1 slot: L1-Cache size: 32KiB capacity: 32KiB capabilities: asynchronous internal write-back data configuration: level=1 -memory description: System Memory physical id: 5 slot: System board or motherboard size: 8GiB -bank:0 description: SODIMM DDR3 Synchronous 1600 MHz (0,6 ns) product: M471B5173DB0-YK0 vendor: Samsung physical id: 0 serial: 377D7043 slot: ChannelA-DIMM0 size: 4GiB width: 64 bits clock: 1600MHz (0.6ns) -bank:1 description: SODIMM DDR3 Synchronous 1600 MHz (0,6 ns) product: M471B5173DB0-YK0 vendor: Samsung physical id: 1 serial: 377D7076 slot: ChannelB-DIMM0 size: 4GiB width: 64 bits clock: 1600MHz (0.6ns) -firmware description: BIOS vendor: LENOVO physical id: 35 version: GLET89WW (2.43 ) date: 07/24/2017 size: 128KiB capacity: 12MiB capabilities: pci pnp upgrade shadowing cdboot bootselect acpi usb biosbootspecification uefi -pci description: Host bridge product: Xeon E3-1200 v3/4th Gen Core Processor DRAM Controller vendor: Intel Corporation physical id: 100 bus info: pci@0000:00:00.0 version: 06 width: 32 bits clock: 33MHz -pci:0 description: PCI bridge product: Xeon E3-1200 v3/4th Gen Core Processor PCI Express x16 Controller vendor: Intel Corporation physical id: 1 bus info: pci@0000:00:01.0 version: 06 width: 32 bits clock: 33MHz capabilities: pci pm msi pciexpress normal_decode bus_master cap_list configuration: driver=pcieport resources: irq:24 *-pci:1 description: PCI bridge product: Xeon E3-1200 v3/4th Gen Core Processor PCI Express x8 Controller vendor: Intel Corporation physical id: 1.1 bus info: pci@0000:00:01.1 version: 06 width: 32 bits clock: 33MHz capabilities: pci pm msi pciexpress normal_decode bus_master cap_list configuration: driver=pcieport resources: irq:25 ioport:4000(size=4096) memory:f0000000-f0ffffff ioport:c0000000(size=301989888) -------------------------- x --------------

ishwaran commented 2 years ago

Remove the option for statistics. Try the following and see if you still get this error:

rfsrc.fast(DependentVariable ~ .,dataUsed, ntree = 1000, perf.type = "gmean")

RacioBR commented 2 years ago

Remove the option for statistics. Try the following and see if you still get this error:

rfsrc.fast(DependentVariable ~ .,dataUsed, ntree = 1000, perf.type = "gmean")

Worked. Thanks.