Open SashaWeinstein opened 2 years ago
AD feedback on output
total_pop-pct-denom
field.total_pop-anh-pct-denom
and other race denominator fields for total pop. Let's discuss how to best record denominators.Hey sorry about the 6 columns in the by puma, I thought I ran the export with all data but I only ran with test data. I will re-run now and then we will have the full columns. Will work through other feedback while that is running
2000 data is from Erica, we are holding off on that
Putting Te's comment here
I am still unsure about the naming convention. In this work race, moe, cv suffix is appended with - and in the wiki it seems that all should be appended with _
.
This was a good comment he made on the export PUMS pull request. The reason I'm not addressing it yet is because of our plan changing math first and worry about column labels second. it makes sense to merge in this export PUMS pull request first as to not create merge conflicts in the aggregation code. We will address this once math is correct
Ok so race crosstabs denominators seem to be correct for indicators but not correct for the total_pop columns. I think we can just take out the total pop crosstabbed by race? We only use that information to calculate percentages and it should only be reported in the relevant denominator columns. We will get correct numbers on race from the decennial census
March 10 review
Review of 1519_borough file
_count
from field names (i.e. lep should just be lep
and not lep_count
same with fb. fb
and not fb_count
)age_median
and not age_hsp_median
)age_p5pl
I thought we said no age_p5pl, because denominator columns stay as is
I thought if something had a specific variable we'd include the field. If that last piece is a heavy life then leave all denominators as is.
sorry I guess I remembered wrong. I'm not sure if I understand "The Race/ethnicity abbreviation and stats suffix should always follow the root field name (i.e. age_median and not age_hsp_median)" What's the issue with age_hsp_median? It's the same as lep_wnh_count or pop_hsp_pct as far as I can see?
Yeah they all need to be updated to follow the convention that we've laid out and have estaibaled with other field names.
Ok so all race crosstab column labels for all processes we have are in the wrong order. Is this something we've talked about before? I don't remember talking about it in standup, and it's not in the long comment. I don't think I can make those changes very quickly
Let's discuss what this would entail after I'm off the phone with Lynn
ok
Goal: output 9 tables containing all data points for the 4 indicators that fall into the demographics category. Note that data comes from Census (all of Race and Hispanic origin comes from Census) or it comes from ACS PUMS data.
Final output will include 9 tables; 1 for each geography level and 1 for each timeframe. Each table should have the same number of columns.
Create new folder in edm-publishing
db-eddt>demographics>demographics_2000_puma.csv
to place output files.Each datapoint will have the following:
Consult data dictionary and data matrix field specifications tab with detailed information on exactly what to pull for each indicator.