socrata / opendatanetwork.com

The Open Data Network
https://www.opendatanetwork.com/
Other
19 stars 12 forks source link

Jobs Proximity Info Statement incomplete #206

Closed zang0 closed 8 years ago

zang0 commented 8 years ago

is:

The jobs proximity index quantifies access to employment oppurtunities in a region. Values are percentile ranked and range from 0 to 100, with higher values corresponding to better access to jobs. Data is available for U.S. counties and is current as of .

note:

should be a date @ the end.

zang0 commented 8 years ago

Also, can we describe this more explicitly? Aren't we taking the mean and median across CBSAs inside of the counties? That should be stated. Also, lets incorporate this info below as well as I think it makes the data more coherent.

I took a crack at it below. Is this accurate?

The jobs proximity index quantifies access to employment opportunities in a region. Values are percentile ranked and range from 0 to 100, with higher values corresponding to better access to jobs. Data is computed for U.S. counties by applying summary statistics across all CBSA regions present in a county and is current as of 2015. [more]

+more leads to this additional text:

The underlying index quantifies the accessibility of a given residential neighborhood as a function of its distance to all job locations within a CBSA, with distance to larger employment centers weighted more heavily. Specifically, a gravity model is used, where the accessibility (Ai) of a given residential block-group is a summary description of the distance to all job locations, with the distance from any single job location positively weighted by the size of employment (job opportunities) at that location and inversely weighted by the labor supply (competition) to that location.

aaasen commented 8 years ago

I don't think this is quite right. This makes it seem like there are multiple CBSAs in one county but it is usually the other way around. Instead of including the methodology on the page, could we make the link to the source more clear? Right now if a source includes the sourceURL property it will be shown in the attribution at the bottom. Just put one in for this dataset but it isn't on staging yet. See environment tab for an example.

zang0 commented 8 years ago

I don't get it. Where do the mean and median numbers come from? I thought we generated this mapping from CBSA to county? + @malindac

malindac commented 8 years ago

The original data was provided by census tract only so there were multiple rows per county. I mentioned in the meeting last Tuesday that values within the same county varied greatly. Since we couldn't map to the census tract, Deep suggested that we calculate the mean, median and standard deviation for each county. So I calculated the mean and median by county.

zang0 commented 8 years ago

So is this text below accurate @malindac ?

The jobs proximity index quantifies access to employment opportunities in a region. Values are percentile ranked and range from 0 to 100, with higher values corresponding to better access to jobs. Data is computed for U.S. counties by applying summary statistics across all census tracts present in a county and is current as of 2015. [more]

+more leads to this additional text:

The underlying index quantifies the accessibility of a given residential neighborhood as a function of its distance to all job locations within a census tract, with distance to larger employment centers weighted more heavily. Specifically, a gravity model is used, where the accessibility (Ai) of a given residential block-group is a summary description of the distance to all job locations, with the distance from any single job location positively weighted by the size of employment (job opportunities) at that location and inversely weighted by the labor supply (competition) to that location.