cal-itp / data-infra

Cal-ITP data infrastructure
https://docs.calitp.org/data-infra
GNU Affero General Public License v3.0
48 stars 13 forks source link

NTD Scraping – Remaining Tables in Dataset #3402

Closed charlie-costanzo closed 2 months ago

charlie-costanzo commented 3 months ago

User story / feature request

Part of #3401

As a data engineer, I would like to facilitate the scraping of the remaining tables in the NTD dataset in similar fashion, extending the work found in scrape_ntd.py and annual_database_service.yml (found below).

Building upon the work completed in #3345

Existing NTD patterns:

General Cal-ITP Pipeline Patterns

Acceptance Criteria

I can successfully extract the data for the following tables from the https://www.transit.dot.gov/ntd/data-product website:

Notes