Closed mdbenito closed 9 months ago
@BastienZim I've worked a bit on your notebook, let me know if you have comments. I was a bit surprised by the very different results that one can obtain with different seeds, often obtaining a degradation of performance with the removal of the worst 20% points. In the end I added complete randomization of the whole run including the splitting of the dataset to see what the true variance is. It is a lot more, but things are predictable. I also added random seed handling to compute_oob and did a couple minor things here and there
Description
This PR adds some text and supporting functions to the OOB notebook.
Changes
compute_data_oob
oob.py
I also sneaked in a couple of unrelated things:
Checklist
"tags": ["hide"]
or"tags": ["hide-input"]