Closed cypressf closed 3 years ago
Still awaiting health data. Shelli will put the MATLAB to Python transition on hold and work on getting the health data ready for the website. Seb will send csv files to Shelli, but they need some processing to figure out county-level data.
Alyssa calculated the PM2.5 data, and I added it. I worked some with Shelli on mortality data, and I'm awaiting that.
Shelli gave me some mortality data.
The cdc data was a bit of a nightmare and I managed to wrangle it to a tidy point, but I didn’t have time to do the estimates. Since that takes a bit of extra care to set up a way to distinguish what’s filled in, I’ve left the code without it at the moment just so we can get something up on the website for right now. The data is ready to go and cleaned, it just still has nans in it. The code for everything I’ve done is on the GitHub and anyone can feel free to take a stab at it, and if not I can get to it when I get back from break. Put it in the dropbox! I do want urge again for everyone to take a closer look at the data and what to do with the suppressed values. For the categories 0-5 and 5-25, almost every single value is suppressed. To do the recommended calculation (taking the sum of the non suppressed and subtracting by the state value to get a percentage for the remaining suppressed) would be almost the same if not in many cases exactly the same as using the state values.
I can't find descriptions, units, or sources in dropbox or office. I'll write up some temporary ones.
I added death data:
I need it in averaged form (deaths / person) in order to show a choropleth map, but for now I'm showing the absolute numbers I have as a bubble map.
this will be in a new "health" tab awaiting data, descriptions, etc