Open bdecon opened 5 years ago
Should be based on CPS design to the extent possible. For example, take into account the area a person lives in and the size of that area.
http://hamelg.blogspot.com/2015/11/python-for-data-analysis-part-23-point.html
https://www.methodsconsultants.com/tutorial/variance-estimation-for-complex-surveys/
https://stats.stackexchange.com/questions/315336/stratified-random-sampling-when-strata-overlap
https://stackoverflow.com/questions/36997619/sklearn-stratified-sampling-based-on-a-column/36998108
https://newonlinecourses.science.psu.edu/stat506/node/27/
https://stackoverflow.com/questions/2413522/weighted-standard-deviation-in-numpy
Some links above--will need to think about this. Start small, including weights, and then, later, try to work on implementations that take into account the various strata.
Write up an example of calculating standard errors when making a common calculation from the CPS. For example, what is the unemployment rate and what is 95% CI for that rate?