arup-group / gelato

Gelato turns MATSim outputs into transport sustainability KPI metrics
GNU General Public License v3.0
10 stars 1 forks source link

PT wait time intermediate output - add count of wait events #25

Open val-ismaili opened 8 months ago

val-ismaili commented 8 months ago

Current structure of intermediate-pt-wait-time.csv is

mode | access_stop_id | hour | Mean [wait_time_seconds]
bus  | 1500MKHALL2    | 9    | 34.750000
bus  | 150G00000876   | 11   | 280.600000
bus  | 1500IM1706A    | 9    | 238.333333
rail | 910GBRTWOOD    | 9    | 205.812500
rail | 910GSHENFLD    | 9    | 326.242424

Can we add a column that logs the count of wait events at each stop in that time period? This is useful context from a transport perspective providing context on significance of waiting times.

Also allows for sense-checking ( / "show your working") of main kpi-pt-wait-time.csv. Currently taking the mean of mean column in the intermediate output will not provide the same number as we would need to weight by count of 'waits'.

steffenaxer commented 8 months ago

The issue of significance also applies to other Key Performance Indicators (KPIs) that consist of aggregated data. Frequently, when comparing these aggregated values, such as averages, the outcomes can seem unexpected, even counter intuitive. This situation may arise if outliers are affecting the combined data in ways that were not anticipated.