r-causal / causal-inference-in-R

Causal Inference in R: A book!
https://www.r-causal.org/
199 stars 52 forks source link

Select data set(s) #9

Closed tgerke closed 3 years ago

tgerke commented 3 years ago
  1. Whole game example
  2. Thread to continue through propensity based models
  3. Survival data: EHR data set. Right heart catheterization (Lucy https://hbiostat.org/data/repo/rhc.html)
  4. Mediation (might work with data set 2.)

To explore: Tidy Tuesday may have candidates Our World in Data Sports data(?). Sabermetrics available, but baseball may be too US centric. Econ! Andrew Heiss has examples? Education, employment Psych data for mediation. Wellness/happiness (general community survey? personality studies)

A simulation exercise would be useful for demonstrating the "correct" answer and how close estimates get to identifying truth. Easier for these not to be medical.

tgerke commented 3 years ago

For EHR data, the Synthea app may be a good option. You can build out custom state transition modules which will simulate EHR data according to a distribution we assign - could be great for checking model validity against known parameters.

LucyMcGowan commented 3 years ago

We can pull in Disney data from here: https://touringplans.com/walt-disney-world/crowd-calendar#DataSets, I'll stick this in an R package.

LucyMcGowan commented 3 years ago

Disney data is here: www.github.com/LucyMcGowan/touringplans

devtools::install_github("LucyMcGowan/touringplans")