Closed mnwright closed 4 years ago
Thank you for catching this - and for using the Export() command.
This appears to be repaired in 0.2-3.
Training with and without autocompression should yield nearly identical results. Differences, if there are any, should all be attributable to the ordering of floating point operations.
Leaving this open for further testing.
Thanks, seems to be working.
Thank you for confirming.
We see increasing divergence between the two training modes as the number of categories increases. Hawk does not feel that this should be attributable solely to differences in floating point accumulation, so we need to take a deeper look before closing the Issue.
In the case of regression, at least, differences in accuracy between training with and without autocompression appear to be due to floating point accumulation. In particular, neither regime is uniformly superior to the other.
Thank you for the very useful test case.
Please feel free to reopen should the issue re-emerge.
With the default value for
autoCompress
, the R package grows only tree stumps when x-data is binary. Example: