Closed evansiroky closed 2 years ago
Describe the bug
It appears that the gtfs_downloader.validate_gcs bucket sometimes fails to upload or maybe download data which fails the DAG task and a whole bunch of other tasks after that.
gtfs_downloader.validate_gcs
To Reproduce
See logs:
2022-02-09 try 1:
2022-02-11 00:51:36,296] {pod_launcher.py:159} INFO - aiohttp.client_exceptions.ClientResponseError: 503, message='Service Unavailable', url=URL('https://www.googleapis.com/download/storage/v1/b/gtfs-data/o/schedule%2F2022-02-10T00:00:00+00:00%2F238_0%2Fcalendar_dates.txt?alt=media')) [20
2022-02-09 try 2:
[2022-02-11 02:16:23,313] {pod_launcher.py:159} INFO - aiohttp.client_exceptions.ClientOSError: [Errno 32] Broken pipe
2022-02-12 try 1:
[2022-02-14 02:04:44,679] {pod_launcher.py:159} INFO - aiohttp.client_exceptions.ClientOSError: [Errno 32] Broken pipe
2022-02-12 try 2:
[2022-02-14 04:38:01,195] {pod_launcher.py:159} INFO - aiohttp.client_exceptions.ClientOSError: [Errno 32] Broken pipe
Expected behavior
The script should be able to gracefully handle file upload/download exceptions with automatic retries.
Working on this in https://github.com/cal-itp/gtfs-validator-api but blocked pending permissions on the repo
https://github.com/cal-itp/gtfs-validator-api/pull/3 is open
Describe the bug
It appears that the
gtfs_downloader.validate_gcs
bucket sometimes fails to upload or maybe download data which fails the DAG task and a whole bunch of other tasks after that.To Reproduce
See logs:
2022-02-09 try 1:
2022-02-09 try 2:
2022-02-12 try 1:
2022-02-12 try 2:
Expected behavior
The script should be able to gracefully handle file upload/download exceptions with automatic retries.