vega / vega-datasets

Common repository for example datasets used by Vega-related projects
264 stars 209 forks source link

docs: add description and source information for jobs.json #593

Closed dsmedia closed 2 months ago

dsmedia commented 2 months ago

To start with what we can be confident about: It's nearly certain the data is ultimately derived from US Census Bureau data, though historical data of this specificity isn't generally made available directly on census.gov. I'd also say it's very likely the data was aggregated by a domain expert from raw IPUMS USA data. But to your question, the immediate source of jobs.json is a bit of a mystery. I've not been able to find the exact datapoints referenced elsewhere (e.g. in a widely cited academic paper). The file was uploaded originally by @arvind. I'm not able to find any other documentation besides the one line in this example. I've contacted IPUMS via email to inquire as well.

domoritz commented 2 months ago

Sometimes if you look at what example this dataset is used in, you can find a corresponding D3 example with an author and a source.

dsmedia commented 2 months ago

Updated with original source (vintage 2006!), permission from IPUMS, and additional context and links.