When a DoubleML model is estimated with apply_cross_fitting = FALSE and n_folds = 2, there are misleading entries in the evaluated score functions as well as the exported predictions. Basically for all indices in the test set the entries are correct and also used for estimating the causal paramter(s), etc. However, for all indices which are not part of the test set, the predictions are filled up with zeros. These zero-predictions are then also later used when evaluating the score functions. These entries in psi, psi_a and psi_b are never used but in my view still misleading. In the case at hand, I would propose to fill the predictions and evaluated score function values with NA instead of zeros and non-meaningful values, respectively.
Description
When a DoubleML model is estimated with
apply_cross_fitting = FALSE
andn_folds = 2
, there are misleading entries in the evaluated score functions as well as the exported predictions. Basically for all indices in the test set the entries are correct and also used for estimating the causal paramter(s), etc. However, for all indices which are not part of the test set, the predictions are filled up with zeros. These zero-predictions are then also later used when evaluating the score functions. These entries inpsi
,psi_a
andpsi_b
are never used but in my view still misleading. In the case at hand, I would propose to fill the predictions and evaluated score function values withNA
instead of zeros and non-meaningful values, respectively.Example