cal-itp / data-infra

Cal-ITP data infrastructure
https://docs.calitp.org/data-infra
GNU Affero General Public License v3.0
48 stars 14 forks source link

Research: percent of providers with a static GTFS dataset. #984

Closed evansiroky closed 1 year ago

evansiroky commented 2 years ago

User Story

As a Cal-ITP program manager or more senior Caltrans executive, I want to know what percent of transit agencies have a GTFS Schedule dataset so that I can quantify the statewide quality of transit data across all agencies and track improvement to this quality over time.

This question should build on build on top of the research performed in cal-itp/data-analyses#379 and calculate a percent of relevant transit agencies that have GTFS Schedule data.

If needed, research could be performed with various stakeholders to determine how to calculate the answer to this question since some transit agencies could have a GTFS Schedule for some of their services but not others. Therefore, there is a decision that needs to be made whether this high level ask is asking for whether some agencies are at least partially accounted for or fully accounted for with GTFS Schedule data. If none of the stakeholders can give a clear answer about how to calculate this baseline, a deliverable of this report should propse at least one recommended option for calculating this baseline.

Acceptance Criteria

Given the data Cal-ITP has collected about transit agencies with respect to how they are funded, what kind of service they operate, and the presence of GTFS Schedule data When applying all relevant criteria about what qualifies as a transit agency for reporting purposes and analyzing for the presence of GTFS Schedule data Then a percentage should be calculated.

The deliverable of this should be as a metabase question that simply shows a number as a percent.

Sprint Ready Checklist

evansiroky commented 1 year ago

Answered via work on GTFS Guidelines Dashboard.