DoubleML / doubleml-for-py

DoubleML - Double Machine Learning in Python
https://docs.doubleml.org
BSD 3-Clause "New" or "Revised" License
464 stars 70 forks source link

Python 401(k) Case Study flex model specification possible issue with .DoubleMLData object #104

Closed gzhelev2020 closed 3 years ago

gzhelev2020 commented 3 years ago

Hello,

in the Python 401(k) Case Study when entering the flexible model data into the dml.DoubleMLData object and then printing it, the y-variable (net_tfa) is seen in the x_cols even after it was specified as y_col.

This leads to the lasso model on the flex specification not estimating the coefficient correctly. For some reason it is only an issue with the flex model, but not with the base model. This is a reacent issue. Last week it was working properly.

See screenshot dml_flex_issue

MalteKurz commented 3 years ago
gzhelev2020 commented 3 years ago

that was the issie, thanks.