cal-itp / data-infra

Cal-ITP data infrastructure
https://docs.calitp.org/data-infra
GNU Affero General Public License v3.0
47 stars 12 forks source link

User Story: Take account of feed_info when determining service #1484

Open lauriemerrell opened 2 years ago

lauriemerrell commented 2 years ago

User stories

As a GTFS producer, if I include feed_start_date and feed_end_date in feed_info.txt, I expect those to be respected when consumers interpret service dates in calendar.txt and calendar_dates.txt.

As a data analyst or GTFS assessments team member, I want to trust that the service listed in the warehouse is an accurate representation of what the producer intended.

Currently, in the Cal-ITP GTFS schedule data pipeline service dates are assessed without any reference to feed_info.txt.

Acceptance Criteria

Notes

This would make most sense to do as part of a type 2 refactor that keys everything off of feed version. These assessments need to be made at the feed level.

Sprint Ready Checklist

    • [ ] Acceptance criteria defined
    • [ ] Team understands acceptance criteria
    • [ ] Team has defined solution / steps to satisfy acceptance criteria
    • [ ] Acceptance criteria is verifiable / testable
    • [ ] External / 3rd Party dependencies identified
lauriemerrell commented 1 year ago

I think this is still something to consider doing. The handling of feed_timezone in #2457 could provide an example of how to pull some values from certain files and apply them feed-wide.