frictionlessdata / frictionless-py

Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
https://framework.frictionlessdata.io
MIT License
709 stars 148 forks source link

Adding GTFS support #1497

Open shashigharti opened 1 year ago

shashigharti commented 1 year ago

Overview

GTFS(General Transit Feed Specification) is not supported because the csv files are in '.txt' format.

frictionless extract *

As a user, I want to be able to use/apply all the features/functions to GTFS format.

jze commented 1 year ago

Here is an example of a GTFS file: https://opendata.schleswig-holstein.de/dataset/fahrplandaten

The ZIP file contains a set of text files that together make up the timetable data.

Archive:  fahrplandaten.zip
 Length   Method    Size  Cmpr    Date    Time   CRC-32   Name
--------  ------  ------- ---- ---------- ----- --------  ----
    4662  Defl:X     1384  70% 2022-12-10 21:34 ea2c5ebb  agency.txt
  102362  Defl:X     8177  92% 2022-12-10 21:34 26a9089a  calendar.txt
 1357759  Defl:X   219524  84% 2022-12-10 21:34 8081678c  calendar_dates.txt
      64  Defl:X       52  19% 2022-12-10 21:28 342dcac8  frequencies.txt
   48874  Defl:X     8012  84% 2022-12-10 21:33 fecab810  routes.txt
   80363  Defl:X     9325  88% 2022-12-10 21:34 1baf0236  service_alerts.txt
88571777  Defl:X 20901188  76% 2022-12-10 21:33 860988ca  shapes.txt
73613000  Defl:X  8506755  88% 2022-12-10 21:33 aab40266  stop_times.txt
 1522149  Defl:X   362895  76% 2022-12-10 21:34 8f241bc6  stops.txt
  518884  Defl:X    78247  85% 2022-12-10 21:34 8991e51c  transfers.txt
 4034572  Defl:X   385009  91% 2022-12-10 21:33 9ff1ad1c  trips.txt
--------          -------  ---                            -------
169854466         30480568  82%                            11 files