Open Klesel opened 1 year ago
@Klesel As I understand it, a p-value range is produced when the test statistic is more extreme than all the samples in the null distribution. I think the intent is to communicate that the p-value was not estimated to be equal to some value, but is less than or greater than a range of values.
It occurs because the p-value can't be estimated using the implemented bootstrap technique; instead we can only say the p-value lies within a range from [zero to n], or alternatively from [n to 1] (either could occur).
@Klesel I added my interpretation of the bootstrap significance test to this issue: https://github.com/py-why/dowhy/issues/929 It is based on reverse-engineering the code.
Using bootstrap samples to test the estimates is ambiguous. Here is how the current output looks like:
Here is a reproducible example:
I raised another issue related to documentation: https://github.com/py-why/dowhy/issues/816 If you prefere, we can merge both issues.