NeighborhoodInfoDC / Vital

Vital statistics (births & deaths)
0 stars 0 forks source link

New Deaths Summary Program #24

Closed rpitingolo closed 6 years ago

rpitingolo commented 6 years ago

@maggiesu0725 can you please set up a program similar to Births_sum_all_address.sas for the death data? Thanks!

ptatian commented 6 years ago

@rpitingolo @maggiesu0725 Please include the 1998 - 2002 death data in the summary files, too. Even though it's older data, we should still use it (unless there's some reason not to).

Similar for the birth summaries. Can we please update those programs as well? Thank you.

rpitingolo commented 6 years ago

@ptatian yes that's the plan. The older data is summarized using %Create_all_summary_from_tracts so I am trying to figure out how to run that and the new process and piece the final files together. @maggiesu0725

ptatian commented 6 years ago

@rpitingolo Ah, yes. I forgot that. Makes sense to tackle that later, then. Thanks!

rpitingolo commented 6 years ago

@ptatian in theory it's not hard, the trouble is that %Create_all_summary_from_tracts requires a file called [name]_tr10 as the input, so I think we basically need to save a legacy version of deaths_sum_tr10 to feed into the macro. But I don't want to call it deaths_sum_tr10 because that would get confusing with all of the other deathssum[geo] variables that have newer data included.

ptatian commented 6 years ago

@rpitingolo I was just reminded by a question that @maggiesu0725 asked me that we should apply the suppr5f. format to the variables in the births and deaths summary data sets, as we do with the TANF and food stamp files. Our agreement with DOH requires us to suppress cells where counts are less than 5. This format is already in the General library and so does not need to be created.

rpitingolo commented 6 years ago

@ptatian thanks, so that should be applied to the base data sets (and as a result the summary datasets)?

rpitingolo commented 6 years ago

@ptatian sorry, just the summary files. Obviously the suppression will depend on the geography in questino.

ptatian commented 6 years ago

@rpitingolo Yes, just the summary files.

rpitingolo commented 6 years ago

@ptatian Hi Peter. @maggiesu0725 pointed out that suppr5f. doesn't suppress zeros (if data = 0 it keeps it as 0). However it sounds like for IRB we are supposed to suppress everything less than 5 which I would interpret as including zero. Do you have a take on this?

ptatian commented 6 years ago

Zero means no one is there so you aren’t revealing any information about anyone.

lhendey commented 6 years ago

@rpitingolo @maggiesu0725 - can this issue be closed?

rpitingolo commented 6 years ago

@lhendey yes