Closed listar2000 closed 1 month ago
You're totally right. My guess as to why this didn't cause an error is that $N$ is very large, so
Yhat_total[n:].mean()
$\approx$ Yhat_total[rand_idx[n:]].mean()
This is a great find. Would you be willing to open a PR that
Thank you so much for your help @listar2000 !
Thanks for the quick response @aangelopoulos and I'm very willing to do that. Will send a PR soon.
Addressed by #19 ; Thank you @listar2000 !
Hi, I have a question regarding how data splitting has been done in the example notebooks. It is pretty obvious so I don't know whether it is a problem or is intentional. Take the
galaxies.ipynb
as example (I think this snippet is reused everywhere though):It seems that the
_Yhat_unlabeled
vector is not constructed through the randomly permuted indices for each trial, while the gold standard_Yhat
does. Essentially no matter how many repetitions are made, the former vector stays the same.