SatcherInstitute / health-equity-tracker

Health Equity Tracker is a free-to-use data visualization platform that is enabling new insights into the impact of COVID-19 and other social and political determinants of health on historically underrepresented groups in the United States.
https://healthequitytracker.org/
MIT License
17 stars 24 forks source link

Manually Spot-Check Maternal Mortality Data #2891

Closed benhammondmusic closed 5 months ago

benhammondmusic commented 8 months ago

Manually Spot-Check Data

do #2890 first. It is essential to remember, our automated tests can only help confirm that the CODE itself is functioning the way we expected. It does NOT ensure scientific or mathematic accuracy. Before writing extensive tests, we need to manually confirm that our calculations / data transformations are accurate. The following checks should be documented in the description of a PR

benhammondmusic commented 5 months ago

SPOT CHECK

Current Year, National, Hispanic, Per 100k

Screenshot 2024-04-24 at 11 08 15 AM

2001, Alabama, Hispanic, Per 100k

Screenshot 2024-04-24 at 11 19 44 AM

1999, National, Black NH, pct_share of MM

In BigQuery pct_rel_inequity = +206.7%

pct_rel_inequity = (pct_share of condition - pct_share of population) / pct_share of population

pct_share of condition = a race groups count of condition / total count of condition all races

black mm count = 186 all mm count = 505

black pct_share_mm = 186/505 = 36.8%

black lb count = 593,200 all lb count = 3,965,200

black pct_share_pop (lb) = 593,200 / 3,965,200 = 15.0%

(36.8 - 15) / 15 = +145.3% higher than expected (not of expected)

Need to do #3192 to more accurately calculate our pct_rel inequity figures