Open marcandre259 opened 2 days ago
Yes, this looks wrong! I'll take a look later. Thanks for reporting!
At second thought, this might not necessarily be a bug, for two reasons:
Will have to think about this more - took a look at the code & it looked mostly fine, though will have to check again. Width of the sampling interval differences looks indeed suspicious.
Hi @s3alfisc,
Based on testing the sharp hypothesis with randomization inference, I would expect the boostrap approach to be the less conservative one then <- edit: actually I'd expect the opposite, since it should be easier to reject for at least one i than the average. Nevertheless, the paper blow gives the counterintuitive result that the randomization (sharp) approach is less powerful (paradox)
I'm quickly peeking in this paper that confirms this with simulations in table 1.
As far as progress on #698, I'll get back to including RI for Westfall-Young now that this is open.
Possible issue I noticed while working on #698.
The behavior was initially noticed when comparing "wild-bootstrap" to the "ri" sample_method p-values when the parameter of interest has no association with the outcome.
With the tight null t-distribution, the resulting p-value is too small.
To reproduce:
Estimation: OLS Dep. var.: Y, Fixed effects: 0 Inference: iid Observations: 998
RMSE: 2.304 R2: 0.0