nickeubank / mtv_viacom_capstone

1 stars 0 forks source link

Safegraph-Only Polling Places #56

Closed jgy4 closed 2 years ago

jgy4 commented 2 years ago

Hi @nickeubank and @adrianefresh,

I pulled all of the polling places that are showing as "unique" to Safegraph in NC into this file SafeGraph_Only_NC.csv.

My sneaking suspicion is that though a lot of these aren't within a couple hundred meters of the CPI polling places according to their coordinates - a combination of address/name/zip code matching would take care of many of these. I think I'll keep working on exactly how many of these polling places, for example, have an exact address match in CPI.

Anyhow, this file contains 68 polling places in NC, and I know that SOME of them don't seem to show up in CPI. Excited to learn if they are legitimate polling places!

nickeubank commented 2 years ago

Hmmm... yeah, I'm finding matches for a lot of these.

e.g. first safegraph is at

36.3254507 | -77.022931 | [{'name': 'Voting Information Project',   'official': True}] | ROANOKE CHOWAN COMMUNITY COLLEGE | 109 Community College Road | Ahoskie

And I'm seeing it in the polling data I pull directly from PublicCI as:

ROANOKE CHOWAN   COMMUNITY COLLE | 109 COMMUNITY   COLLEGE RD, AHOSKIE, NC 27910 | scraper | 10/25/20 |   | POINT (-77.02293089999999 36.3254507) | POINT (-77.02540999999999 36.32356) | POINT (-77.019828 36.324871) | POINT (-77.01949999999999 36.324) | POINT (-77.02540999999999 36.32356)

Row 2 is:

35.1138386  -77.8188131 [{'name': 'Voting Information Project', 'official': True}]  ALBERTSON RURITAN CLUB  3710 N NC 111 AND   Albertson   NC  28508

And I see in PublicCI at:

11/3/20 NC  DUPLIN  DUPLIN  county      ALBERTSON   29  election_day    ALBERTSON RURITAN CLUB  3710 NC 111 AND 903 HWY, ALBERTSON, NC 28508        scraper 10/25/20        POINT (-77.8185582 35.0565845)  POINT (-77.81841 35.11007)      POINT (-77.81833 35.11411)  POINT (-77.81841 35.11007)

Would you mind just cross-referencing those two records and seeing if they're just landing outside 100m, or if the PublicCI record is getting dropped somewhere.