hasadna / open-bus

:bus: Analysing Israel's public transport data
93 stars 29 forks source link

siri data should reflect changes in OriginAimedDepartureTime / siri_ride__scheduled_start_time #390

Open OriHoch opened 2 years ago

OriHoch commented 2 years ago

reproduction steps

expected

actual

OriHoch commented 2 years ago

The OriginAimedDepartureTime for that ride changed. In snapshot 2022/04/03/04/54 it was 07:00 (Israel) and in later snapshots it changed to 06:54 (Israel).

Our code uses the first OriginAimedDepartureTime it encounters for a ride and uses it for all future vehicle locations of that ride, we don't keep track of changes in OriginAimedDepartureTime.

OriHoch commented 2 years ago

@EyalBerger wrote:

I think that from analytical point of view this ride should recoded as two rides: one with OriginAimedDepartureTime of 07:00 and one of 6:54, from two reasons:

We want to allow users to load SIRI data is it in the original records. Those changes could be done by operators from different purposes, and if we want to explore their nature and patterns we need the data to be identical to SIRI source files.

OriHoch commented 2 years ago

This greatly increases the complexity of our processes and DB structure, so will keep it in backlog for now.

Users that want to can always download the source SIRI data and see all the details. Another option is to add an API method that makes that process easier.

EyalBerger commented 2 years ago

Thanks for the summary. I think it's important to reflect the source. I suggest to discuss it with the team in the next internal meeting to get a better understanding of how to prioritize this taking into account the complexity here.