Closed ypotdevin closed 2 years ago
Counter measure 3 might be enhanced by recording each tree and the threshold it would have passed (but did not, since the comparison was stricter), and inserting it into the ensemble once (in retrospect) the threshold is loose enough.
Counter measure 2 may be realized by keeping a history of enhancements (prev_loss - current_loss
if current_loss < prev_loss
). Then, check whether the current enhancement is significantly better then previous (recent) enhancements. For example, calculate a (moving) average and a (moving) standard deviation [or their robust versions] and check whether the current enhancement is exceeds avg + n std for some small n*.
In dp_rmse.py, the lines
may yield (large) negative values – unreasonable for rMSE loss values. These may be some viable counter measures:
prev_loss < current_loss
according to a (yet to be determined) schedule step by step, so that new trees get a chance to join the ensemble again (which would otherwise be highly unlikely).