jsoma / data-studio-projects

12 stars 18 forks source link

Which NY City Precinct Has the Most 2018 Homeless Shelter Arrests So Far? #160

Open zle2105 opened 6 years ago

zle2105 commented 6 years ago

Pitch

The NYPD posted a small database of all the homeless shelter arrests from 1/1/18 to 3/1/18. I am going to compare the locations of the NYPD Precincts making the arrests to the distribution of homeless shelters in the city to determine whether the arrests are distributed equally or if there are outliers.

Summary

So far, I have information on whether these arrests are for felonies, misdemeanors, or violations. I will be comparing the distribution of this data between the precincts as well.

My inspiration is this chart from Five Thirty Eight with depicts annual fluctuations in homicides. While I will not be graphing an event over time, I do like how it portrays three different aspects of a crime stat on one image. My only concern is that it may look too busy.

image

Details

Right now, I am looking for NYPD precinct statistics to supplement my research with.

Possible headline(s): (See Above) Which NY City Precinct Has the Most 2018 Homeless Shelter Arrests So Far?

Data set(s): https://www1.nyc.gov/site/nypd/stats/reports-analysis/homeless-shelter-arrests.page

Code repository:

Possible problems/fears/questions: I am worried that the data pool may be too small and give an exaggerated result. I would like to find stats from either NYC or another comparable city. It may be easier for me to compare arrests in a three month period from a previous year as opposed to extrapolating these stats over a year long period.

Work so far

Compiling df with the csv info.

Checklist

zle2105 commented 6 years ago

I completed my data sets and have created a few preliminary charts.

My main concerns now are:

  1. How to honestly portray the missing Precinct data on the charts?

  2. How to compare the results to the amount of shelters in each Borough?

  3. Do both without cluttering the chart in annotations

The First is a graph of the Total Number of Arrests for Each Borough image

The Second Graph is for the Total Number of Felonies for Each Borough

image

The Third Graph is for the Total Number of Misdemeanors for Each Borough image

The Fourth Graph is for the Total Number of Violations for Each Borough image

Lastly, I have a graph for the Top Five Precincts for Arrests (I am going to label them with each Borough Name). The name of the Precinct is the Y-Axis (this label needs to be clearer). image

playfairbot commented 6 years ago

Greetings! I'm a little robot, checking in on your project.

Please post your first revision! It should be posted by Thursday at midnight. More details available here.

If this is your update, make sure you followed the update template so my robot eyes can see it. Click the pencil in the top right-hand corner to edit/update that comment.

You need some feedback, let me summon @Katerinavts, @Palarisk, @nickospi for you

It looks like we need to fix up your pitch a little bit! Edit it by clicking the pencil in the top right-hand corner. It requires:

Palarisk commented 6 years ago

Hi, really interesting story idea! Your question number 2: How to compare the results to the amount of shelters in each Borough?

Do you have data on the number of people living in these shelters? If so, you could compare the results as an arrest/people ratio. If you don't have that, you could also calculate the arrest/shelter ratio, which would still be meaningful info eg for people living nearby

nickospi commented 6 years ago
dz2383 commented 6 years ago

Interesting topic! Here are my suggestions:

sarahslo commented 6 years ago

so good that you have taken these steps and charted the data. thing is here, part of what we are seeing is a reflection of the population. that's why the charts look the same. these data need to be divided by the number of people per borough.

it's like saying, china has a lot of people who ride bicycles but hawaii has less...so next step is to normalize it by population.