cal-itp / data-infra

Cal-ITP data infrastructure
https://docs.calitp.org/data-infra
GNU Affero General Public License v3.0
47 stars 12 forks source link

Analysts Can Quickly Query GTFS Schedule Data #447

Closed charlie-costanzo closed 2 years ago

charlie-costanzo commented 2 years ago

Analysts currently have a number of tools available to them to query the data. However, due to a lack of data documentation, they'll likely have a difficult time navigating and understanding the data warehouse.

Problems:

Solutions:

Create issues from:

edasmalchi commented 2 years ago

I think it might help if the docs were clearer on how to find which service date(s) is/are represented in GTFS Schedule Feeds Latest, whether they are consistent between operators, and whether they need additional date filtering for effective analysis.

Also perhaps an example on when it might make sense to use those schedule feeds vs. the fact/dim tables available in warehouse_views.

machow commented 2 years ago

Declaring bankruptcy on this--since this is a issue I put up when @charlie-costanzo was still getting a feel for things. He's stubbing out epics for the parts of the docs that should tackle this (see https://github.com/cal-itp/data-infra/issues/690).