Closed kitkhai closed 3 months ago
The estimator
argument takes as an argument the estimator you would like to use.
If you're hoping to estimate the population mean, the standard estimator would be the sample mean.
the stimator should be lambda y : y.mean()
, since the sample mean is the estimator of the population mean.
That works! Thanks!
However, looking at the output, there are some things I don't really understand...
The confidence interval of the mean is (-0.0032521653800733795, 0.0038037445287558255)
which does not include the mean of my Yhat_unlabeled which is 2
.
Y
& Yhat
)?It should contain the mean of Y
, not the mean of Yhat
.
The Yhat
variable is the synthetic data (ML generated), and it doesn't have the "right" mean. The Y
data represents the small gold-standard dataset. So:
Yhat
. It should contain the mean of Y
with high probability.alpha
this will fail to happen, but alpha
is usually set to be small (e.g. 0.1).
Hi I don't really understand how to use the PPBoot function when I want to calculate the mean CI.
ps I used
lambda y: y
as an estimator as I rather not have to run my Machine Learning/LLM model to get the labels again? Or must I pass the model for it to do inference?