matheusfacure / python-causality-handbook

Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
https://matheusfacure.github.io/python-causality-handbook/landing-page.html
MIT License
2.77k stars 482 forks source link

Question about Chapter 21 - Training data of meta-learners #398

Open agourlaouen-tf opened 5 months ago

agourlaouen-tf commented 5 months ago

Hi,

It is not clear to me what are the conditions on the data used to train meta-learners. The text says: This time, we will use non-random data to train the models and random data to validate them. Dealing with-non random data is a much harder task, because the meta learners will need to debias the data AND estimate the CATE. If I understand correctly, we do not need to ensure that causal inference assumptions such as covariate balance on the training are valid, it is only at inference time, if we want to make sure that what we are computing is indeed the average treatment effect, that we will need to make sure the data is representative of the population the treatment would be applied to. Is that correct ?

Thanks