dpzhang / Project_AHDA

CAPP30123 Course Project
0 stars 2 forks source link

Helper Variables -- #2

Closed dpzhang closed 7 years ago

dpzhang commented 7 years ago
  1. pickup regions based on community

  2. dropoff regions based on community

    • 9 regions: Far North Side, Northwest Side, North Side, West Side, Central, South Side, Southwest Side, Far South- west Side, and Far Southeast Side.
    • This grouping scheme was invented by the University of Chicago Social Science Research Committee, officially recognized by the City of Chicago
  3. absolute distance based on pickup and dropoff coordinates

  4. ratio of real path length over shortest path length (RRSL)

    • dividing relative/actual distance by absolute distance
  5. Absolute Velocity: Absolute Distance / Trip Duration

  6. Relative Velocity: Relative Distance / Trip Duration

  7. Ratio of real velocity over relative velocity (RRVV)

  8. Ratio of real path travel time over shortest path travel time (RRST)

    • relative/actual time: in the dataset
    • absolute time: actual_distance / average_velocity
    • interpretation: condition on averaged velocity of the trip, how many more seconds did the driver waste for every second the trip have to take
  9. Time Period: 8 levels

  10. Day: Indication of if weekday, weekend, holiday

    • what about special events (sports game? parade? etc.)
dpzhang commented 7 years ago

Challenge

  1. variable 1 & 2 will need to manually create a dictionary with keys to be community in a region and values to a region

  2. variable 10 how are we dealing with holidays? just ignore it or have to classify holiday? I propose to ignore it due to time constraint... thoughts?