afimb / gtfslib-python

An open source library in python for reading GTFS files and computing various stats and indicators about Public Transport networks
GNU General Public License v3.0
44 stars 6 forks source link

Add stop clustering #32

Closed laurentg closed 8 years ago

laurentg commented 8 years ago

When doing data analysis, one want to see data aggregated by "station" (a restricted geographical zone containing several stops). But some GTFS data do not aggregate stop into stations; or stops comes from various different GTFS. So stop spatial proximity is not always encoded in the data in an easy and accessible way.

laurentg commented 8 years ago

The idea is to add a utility function/class to cluster close stops into a "stop cluster", bypassing the notion of "station". Every set of stop where two stops are closer than a certain limit is bundled together in a cluster.