cagov / caldata-mdsa-caltrans-pems

CalData's MDSA project with Caltrans on Performance Measurement System (PeMS) data
https://cagov.github.io/caldata-mdsa-caltrans-pems/
MIT License
7 stars 0 forks source link

Aggregate raw 30-second data to five minutes #117

Closed ian-r-rose closed 7 months ago

ian-r-rose commented 7 months ago

Many downstream metrics (e.g., speed) are based on five-minute aggregations of the raw VDS data. Ideally, this would include imputations, but for the time being, we want to aggregate the un-imputed raw data to 5 minutes to unblock further metric development.

AzamBeg-Caltrans commented 7 months ago

30-sec flows should be summed up over 5 mins, lets say into 'Lane_1_Flow_5m'. 30-sec occupancies should be averaged over 5 mins, lets say into 'Lane_1_Occ_5m'.
Then: Lane_1_Speed_5m = Lane_1_Flow_5m / Lane_1_Flow_5m / 600

ian-r-rose commented 7 months ago

@AzamBeg-Caltrans that's what is happening in #118. I'm also averaging the speeds as reported by the clearinghouse data, but I don't know how reliable they are, so we should feel free to ignore them in favor of your computations

kengodleskidot commented 7 months ago

@ian-r-rose I will work on the speed calculation to recreate what I believe @AzamBeg-Caltrans is looking for. Do you want me to create a new branch to work on the speed calculation or should I make changes in the 5-minute-aggregates branch?

ian-r-rose commented 7 months ago

Let's do it in a new branch @kengodleskidot