UDST / synthpop

Synthetic populations from census data
BSD 3-Clause "New" or "Revised" License
100 stars 46 forks source link

Extremely large person errors for some rows using non_census_synthesis #77

Open werdnabae opened 1 year ago

werdnabae commented 1 year ago

I am getting extremely large errors using the sample data (hh_marginals.csv, household_sample.csv, person_marginals.csv, person_sample.csv) and I'm generating the synthetic population using the non_census_synthesis notebook. The generated households match the marginals very well, but the persons are not matched well at all.

In this picture, I calculate the percent difference between the synthesized and actual marginals. As you can see many of the differences are very large. image

I've also tried generating synthesis using my own queried data, and I'm having the same problem with the person distributions not matching well.