DoubleML / doubleml-for-r

DoubleML - Double Machine Learning in R
https://docs.doubleml.org
Other
129 stars 25 forks source link

[Feature Request]: Observation weights #192

Open kamielkanfoudi opened 7 months ago

kamielkanfoudi commented 7 months ago

Describe the feature you want to propose or implement

Hi! I am working on using the DML package to try and reproduce previous impact evaluation studies done with OLS to study causal inference. I have a dataset that includes observation weights and I can't seem to find a way to implement this as a parameter when fitting the PLIV model. Resampling the dataset has not seemed to work as the weights are within quite an unconventional range, [0.000004,0.05], and it becomes difficult to expand the data and keep the runtime at reasonable level. I'm not sure if I am missing something in the package where I can include these features. Thank you for your time!

Best,

Kami

Propose a possible solution or implementation

No response

Did you consider alternatives to the proposed solution. If yes, please describe

No response

Comments, context or references

No response

PhilippBach commented 7 months ago

Dear @kamielkanfoudi

thanks for your feature request. We agree that observation weights are very important for applying DoubleML on real data sets. We'll add this feature to the list of future extensions!

Unfortunately, I'm not yet aware of a paper that goes through the weighting adjustments in DoubleML. We'd have to read a bit more on this. The complication that arises in DoubleML as compared, for example, to linear regression is that the weights show up in two separate tasks: First - and this is what I guess you probably have in mind - they have to be accounted for during the estimation of the (causal) regression parameters. Second, they might play a role for the classification and regression ML learners.... I'm not sure, if ignoring them in the 2. step might create issues.

We'll investigate this in further detail and also discuss it with some colleagues who are more familiar with that literature, but adding it to the package might take a little bit of time.

Once more, thank you and best,

Philipp