UrbanInstitute / education-data-package-stata

MIT License
19 stars 4 forks source link

Subsetting data while using csv option #60

Open ericatheresa opened 4 years ago

ericatheresa commented 4 years ago

I think this has come up before, but I couldn't find it. When there is a single CSV associated with multiple endpoints, as is the case for CCD enrollment or EDFacts assessments (where each endpoint has the same variables but is subset for different subpopulations), using the csv option with the stata command leads to unexpected behavior. For example, if I ask for

. educationdata using "school ccd enrollment", sub(year=2014 fips=11 grade=8) clear csv

I expect to get no breakdowns for race, sex, etc, but instead I get all of those breakdowns. Is there a way to tell the stata package to do this subsetting?

This is also the case for district enrollment, school & district assessments, and possibly other endpoints as well.

VivianSihanZHENG commented 4 years ago

hi @ericatheresa thanks for opening this issue - This is exactly what I will be working on - adding filter variables to the csv options in the STATA package.