Open GoogleCodeExporter opened 9 years ago
the default nodesize is 5 and usually that works best. smaller nodesize will
create larger trees but may not work towards generalization for regression.
nodesize = number of examples in the terminal leaf nodes
search mtry=D/10:D/10:D where D is the number of features and do that couple of
times and average the results. the best mtry should be the one with the lowest
average ooberr
Original comment by abhirana
on 15 May 2012 at 11:07
The default nodesize for classification is 1, correct? and you dont suggest to
change it?
Original comment by m.saleh....@gmail.com
on 15 May 2012 at 11:16
nope i wont suggest changing that because unlike regression, in classification
its always one of the classes. if the nodesize is increased it may cause the
tree to have multiple examples from multiple classes and at the leaf node may
have to resort to finding the winner class and i dont think that is a good
thing to be doing for classification trees
Original comment by abhirana
on 15 May 2012 at 11:19
Original issue reported on code.google.com by
m.saleh....@gmail.com
on 15 May 2012 at 11:02