theonaunheim / surgeo

Open Source Proxy Demographic module written in Python
MIT License
32 stars 16 forks source link

Returns multiple records for some ZCTAs #10

Closed chriszs closed 4 years ago

chriszs commented 4 years ago

Appears to be those that cross state lines. Examples: 69201, 51360, 59270

theonaunheim commented 4 years ago

Good call out--thanks.

I'm adding the following line to fetch_geography.ipynb ...

df = df.groupby(df.index).apply(sum)

... which should sum the files on an index (zcta) basis prior to processing. Will run overnight and see what shakes out.

theonaunheim commented 4 years ago

@chriszs When you have a moment would you please update to version 1.0.2, check whether this works on your machine, and close this out if fixed?

Thanks again for catching this--much appreciated.

chriszs commented 4 years ago

Looks good! Thank you!