We don't seem to have unit tests being sensitive for the bug fix in the aggregation formula. In my ongoing major update of the unit test framework I will add this extension.
In our R vs. Python package tests we so far didn't had a test case with repeated cross-fitting and therefore the difference between the implementations didn't become visible: I added such a test case in https://github.com/DoubleML/doubleml-py-vs-r/pull/4. As the tests are now sensitive for the aggregation formula, they also fail in the PR which will be resolved when the R package got its bug fix.
I think there is a bug in the aggregation of standard errors from repeated cross-fitting.
Description
se
are not equal to sigma_hat but the scaled asymptotic standard error, i.e.,_all_se
, i.e.,Implementation
Unit Tests
Documentation
_all_se
we don't store the unscaled standard errors sigma_hat_m but the scaled / asymptotic standard errors sigma_hat_m / sqrt(N). We should adapt this accordingly.