pzivich / zEpid

Epidemiology analysis package
http://zepid.readthedocs.org
MIT License
141 stars 33 forks source link

TMLE additions #39

Closed pzivich closed 5 years ago

pzivich commented 5 years ago

This is a longer term project. As I am reading through Targeted Learning, I will add to this list regarding features I would like to add. Also important notes that I have gleamed from the book.

pzivich commented 5 years ago

Just a side thought related to this. Might consider calculating ALL available measures when TMLE is fit. This would avoid the issue of having to re-specify the model each time (which could be time-intensive for complex ML). Might be better to dump all effect measures to the user (since computation time is small for computing all measures)

pzivich commented 5 years ago

Missing data process based on R tmle:

1) Estimate missing data model (something like missing_model() optional argument)

2) Multiply g1W and g0W by corresponding p(missing=0)

3) Missing needs to factor into influence curve calculation. ONLY the indicator though, not the weight itself

Help for missing data: https://www.jstatsoft.org/article/view/v051i13

pzivich commented 5 years ago

Don't plan on adding C-TMLE anytime soon. The problem it is used to solve can be fixed via cross-fitting (to my current understanding. if that changes, will consider adding). C-TMLE purports to give the correct answer when both g- and Q-models are incorrect. I think this is an artifact of the Kang-Schafer data

As for now, these remaining additions are on the back-burner. If anything changes or users request, I will re-open