Open skinkie opened 1 month ago
When processing data we know from GTFS that (Agency, Route) > Trips would guarantee partitions of data that can be processed in parallel. This also means that there is no overlap between the ServiceJourneyPatterns it could or would generate. Obviously the pattern may be the same, but since the RouteRef in NeTEx must point to a dIffert route for a different Line (if you wouldn't use Mentz software, that would require LineRef on ServiceJourney level) it has be unique.
Hence given the above the oppertunity is to keep a hash list of created ServiceJourneyPatterns in memory per (Agency, Route), as being a shortcut of the source standard.
I am not so sure. We have routes that are served by multiple agencies.
but they are rare, so it might work.
I am not so sure. We have routes that are served by multiple agencies.
From GTFS standpoint this is not possible. A route in GTFS has a single agency_id. So in GTFS it is either multiple (the same) routes, or introducing the combined agency.
I would not like to introduce new agencies. I think if we in such cases have two Pattern instead of one, then we still have reduced the number of Pattern a lot and the rest is ok as duplicate.
From GTFS -> NeTEx we can do what we want with ServiceJourneyPatterns, since the concept does not exist in GTFS. My aim would be that we have a shortcut from GTFS to NeTEx to (be able to) directly infer TimeDemandTypes and ServiceJourneyPatterns since doing it directly from GTFS would significantly reduce the operational cost. That having said it is obviously an extra importing step, so we must give the user the option to:
@skinkie a bit more description?