nickeubank / mtv_viacom_capstone

1 stars 0 forks source link

THE thing we're most interested in #94

Open nickeubank opened 2 years ago

nickeubank commented 2 years ago

Hey All,

For meeting Monday, the thing that Adriane and I are most interested in seeing I think are clear articulations of the exact population underlying all of the statistics we use. I think you are all doing a terrific job of measuring distances using different metrics, in the speed with which you have picked up the nuances of geospatial analysis are terrific! But I think the thing that we don't have a really good grip on is the exact population (sample) for which we are calculating our statistics

In particular:

Colleges

I understand we're using the 2,500 colleges and universities that we were able to merge between the HIFLD and the +1 demographic data, but what population does that correspond to? In our press release, when we say "only 30% of {define sample} colleges and universities have a polling place within a mile of campus", what goes in the {define sample} spot? It isn't ALL colleges and universities in the US, so what is it?

HIFLD as a public dataset and so it's fine to refer to that, but nobody else has this +1 spreadsheet, so we have to be able to define the population of colleges and universities that were included in the data. #84

States

We also want to make sure we're clear on the states we're including. For example, for early voting, we're using the Ballot Ready data (Yay!), and particularly the (number?) states with "meaningful" early voting (meaning we drop [list of states] that have no early voting, and [list of states] that only allow early voting at the government office.

For election day, we're using CPI for [list of states], and [other source? for which states?] for our other states.

jgy4 commented 2 years ago

Hi all, looking forward to meeting tomorrow!

In regards to defining the 'Colleges' Sample do you think it would be sufficient to say "Campuses With Demographic Data Compiled by The Students Learn Students Vote Coalition from IPEDS (The Integrated Post-Secondary Education System) & OPE (Office of Post-Secondary Education)"?

Additionally, in regards to states I've compiled a list of state-by-state early voting laws/sources here:

Based on this we intend to exclude Alabama, Hawaii, New Hampshire, North Dakota, Rhode Island, South Carolina, Vermont, Washington, & Wyoming from our early voting statistics.

Looking forward to discussing this more!

nickeubank commented 2 years ago

"Campuses With Demographic Data Compiled by The Students Learn Students Vote Coalition from IPEDS (The Integrated Post-Secondary Education System) & OPE (Office of Post-Secondary Education)"?

I think we need to reach out to them and ask them what criteria they were using to include colleges in their spreadsheet. ie is this everyone in IPEDS? What we really want to be able to do is give the reader information about how the population we are studying relates to all the actual colleges and universities in the US.

nickeubank commented 2 years ago

(Comments on spreadsheet soon)

nickeubank commented 2 years ago

Based on this we intend to exclude Alabama, Hawaii, New Hampshire, North Dakota, Rhode Island, South Carolina, Vermont, Washington, & Wyoming from our early voting statistics.

I think your list seems terrific! You could also e-mail the +1 team to ask about the couple that you're on the fence about when you e-mail them to ask where they got their list of 3,000 colleges and universities.

Just to expand on my message from a few hours ago, what we need to do is be able to articulate how our sample population relates to the universe of colleges and universities. For example, we need to be able to say that we aren't starting from a list from the +1 team of colleges and universities with poor access to early voting! At this point because we haven't gotten an explicit explanation of how that list was created, we can't even say that.

We will also want to get some basic summary statistics about how the 2,500 colleges and universities that merged with the homeland security database differ from the five hundred that didn't.

jgy4 commented 2 years ago

Ah okay this makes sense! I'll definitely send that email to ask for clarification on both.

dapoade commented 2 years ago

Hi @nickeubank and @adrianefresh .

Here is the link to google documents where we added a number of statistics for your review, in preparation to give them to Vaughan.

https://docs.google.com/document/d/1Vgeg74FrPzWwxI8VJyz4hmbRACehChJiaIa8WboYHV0/edit

adrianefresh commented 2 years ago

Wonderful, thank you. We're reviewing them!

Adriane

On Sun, Jan 9, 2022 at 11:37 PM dapoade @.***> wrote:

Hi @nickeubank https://github.com/nickeubank and @adrianefresh https://github.com/adrianefresh .

Here is the link to google documents where we added a number of statistics for your review, in preparation to give them to Vaughan.

https://docs.google.com/document/d/1Vgeg74FrPzWwxI8VJyz4hmbRACehChJiaIa8WboYHV0/edit

— Reply to this email directly, view it on GitHub https://github.com/nickeubank/mtv_viacom_capstone/issues/94#issuecomment-1008539406, or unsubscribe https://github.com/notifications/unsubscribe-auth/AEXM52PZ3SD3VFDO6RFESETUVJO7TANCNFSM5LNKKXIA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you were mentioned.Message ID: @.***>