stuartong / streetsofnyc

MADS Milestone 1 - Sheila/Moutaz/Stuart
0 stars 1 forks source link

Datasets #5

Open stuartong opened 3 years ago

stuartong commented 3 years ago

For highlighting and discussing all alternative datasets we can consider:

Main:

  1. NYC Tickets
  2. Uber
  3. Yelp

Issues: Yelp: public dataset is not NYC - @mgendia working on it Uber: spotty and missing some months

Alternatives proposed:

  1. NYC Collision data - @sheilavp see if there is correlation between models in terms of accident and tickets
  2. New York Taxi Data - might be an alternative to Uber? Haven't explored it but heard on office hours it's quite a rich data set
sheilavp commented 3 years ago

New York Taxi data, I saw it before. We can pick both yellow and green taxi (yellow is mostly Manhattan island, and green is mostly Brooklyn if I remember correctly) OR pick just one taxi colour, but it is definitely another huge data.....

stuartong commented 3 years ago

Problem is taxis have been around for ages so can't see effect in violations - instead see effect of uber on taxi rides?

stuartong commented 3 years ago

Also maybe we are finding questions in the data when we should put ourselves in the shoes of maybe NYC city officials or something to that effect to ask more interesting questions that are relevant

stuartong commented 3 years ago