Data4Democracy / drug-spending

Project to understand pharmaceutical spending, currently focused on US government programs.
72 stars 46 forks source link

Investigate new data source: Medical Expenditure Panel Survey #22

Closed mrklees closed 6 years ago

mrklees commented 7 years ago

Task

Data is annually collected by the Agency for Healthcare Research and Quality that is used to make nationally representative estimates of health care use, expenditures, sources of payment, and health insurance coverage. Investigate the website housing all of this data, meps.ahrq.gov, and find any data files which include details on prescription drug spending.

How this will help

The data available on the site could help us understand the trends in prescription drug spending over the last decade. It could help add context to our analysis as well, as the data is not limited to Medicaid and Medicare recipients.

What we want to get out of this

The data we're interested on is a subset of what is available, so a first task might be to compile a list of names and descriptions of the data of interest. Some of the data sets have nearly 2000 features in the data, so it would also be very helpful if fields of interest were identified for each data set and clearly documented.

jenniferthompson commented 7 years ago

Awesome - thanks @mrklees!

mrklees commented 7 years ago

I have begun to document the data available on meps.ahrq.gov in this google sheet.

Update: If anyone is looking for a way to collaborate on this, a good way would be to open up the google sheet and the navigate to the documentation of the 2014 Full Year consolidated data file. Then dig through it and try to find any fields that might be interesting in future analysis. There are 1839 features, so there's plenty to consider.

mtelpoukhovskaia commented 7 years ago

@mrklees, I see there are 43 fields of interest that you've identified. Are you currently looking for more , or working on tidying the data? I can help out on this issue.

jenniferthompson commented 7 years ago

@mtelpoukhovskaia I'm not aware of any further work that's been done @mrklees's post, so if you'd like to run with this that would be great!

mrklees commented 7 years ago

Hi @mtelpoukhovskaia, sorry for being unresponsive. The most interesting data I have found through MEPS is the Prescribed Medicine Files (link in the gsheet). It tracks patient spending and consumption of individual drugs along with a ton of demographic markers and other fields. I have been working towards trying to combine all data from 1996 to 2014 into a single data file, but I have had a lot of trouble cleaning the data. Here are a couple things that would be super helpful:

Tonight I will upload the work that I've done to my fork of the repo so that you can see what work I've already done.

mattgawarecki commented 7 years ago

@mrklees @mtelpoukhovskaia Checking in to see how work on this issue is going. If either of you has run into any obstacles, we'll do what we can to help. 👍

darya-akimova commented 6 years ago

While the MEPS survey is uploaded on data.world, I'm not sure how it can be used at the current project step. Will close this issue for now and open a new one if there seems to be a use for it in the future.