microsoft / showwhy

MIT License
200 stars 28 forks source link

Some thoughts on estimator explainers (WIP) #169

Open robertness opened 2 years ago

robertness commented 2 years ago

Don't mind me. Just scribbling down some suggestions as I familiarize myself with the package.

On the "exposure-based models" image

One way to explain: Models that adjust for many confounders at once using propensity scores. Propensity scores are estimates being assigned to the exposed group. These models calculate propensity scores for each individual, even when they weren't in the exposed group.

On the "outcome-based models" image Perhaps linear regression should be presented as the default choice so as not to encourage people to use the more heavy duty models when a simpler model will work fine and is more defensible?

Relatedly, in the text below... image it says it uses bootstrap CI for "Linear Regression models". Perhaps it should be "Outcome-based models", since that would allow for nonlinear double ML models in the future?

Also the linear regression estimator in DoWhy does not use bootstrap. It uses the statsmodels library, which uses t-distribution-based CI.

GeorgeS2019 commented 1 year ago

@robertness

ShowWhy enables emulation of randomized controlled trials that produce a high standard of real-world evidence.

I could not find any references.

An animated screenshot in a gif showing how the package works would be great!

GeorgeS2019 commented 1 year ago

Found it! Community Workshop on Microsoft's Causal Tools

Causality and Machine Learning

image