The code that reads raw GTFS data into dataframes doesn't have an explicit data type for service_id, requiring it to be inferred. In schedules with a mix of numeric and alphanumeric service ids, this can cause some to be read as numbers and some as strings, preventing joins from working properly. This change makes service_id values be from CSV as strings.
The following snippet should illustrate the issue for at least the next few weeks:
The code that reads raw GTFS data into dataframes doesn't have an explicit data type for service_id, requiring it to be inferred. In schedules with a mix of numeric and alphanumeric service ids, this can cause some to be read as numbers and some as strings, preventing joins from working properly. This change makes service_id values be from CSV as strings.
The following snippet should illustrate the issue for at least the next few weeks: