Open kjappelbaum opened 2 years ago
Perhaps there is no, need as there's already https://github.com/omixlab/y-scramble.
However, they scramble only the train set (?) and leave the test set intact
Perhaps this would be an interesting benchmark #244
more details here https://www.jmlr.org/papers/volume11/ojala10a/ojala10a.pdf
I think a valid implementation would be the following:
In this way, we have a bootstrapped effect size.
probably best if we make a dedicated small package for this
giving it a shot here https://github.com/kjappelbaum/yscrambler
https://pubs.acs.org/doi/10.1021/ci700157b
https://onlinelibrary.wiley.com/doi/epdf/10.1002/qsar.200390007
the most prominent use I know https://www.science.org/doi/10.1126/science.aat8603
some discussion about the usefulness here https://stat.ethz.ch/pipermail/r-help/2010-March/230856.html, in particular
Which makes sense to me