Allow GithubStream to continue on specific errors

ericboucher commented 2 years ago

As I was implementing a GithubStream for community_profile, I stumbled upon the following issue:

Some repos actually return a 404 when we request their community_profile. It is quite lucky that this got caught early thanks to the main test repo being one of these edge-cases. Though I still do not understand why.

https://api.github.com/repos/facebook/react/community/profile works...
https://api.github.com/repos/MeltanoLabs/target-athena/community/profile doesn't work 🤯 -> 404

As a workaround, we could allow tolerated_http_errors and handle them differently. This is what I have implemented here. But I am open to other ideas 😄

Open question:

Is it worth implementing at the SDK level? This seems a bit specific to GitHub but 🤷

ericboucher commented 2 years ago

Linked issue on the SDK - Error Handling - https://gitlab.com/meltano/sdk/-/issues/134

aaronsteers commented 2 years ago

@ericboucher - For the SDK generic handling, I'm thinking a multi-facet approach along these lines:

Add a public and overridable method in RESTStream called raise_if_error() (or similar), which would abstract the case logic in _request_with_backoff(). By default, the raise_if_error() would check status codes just as we do today, and then raise one of a specific set of exception types to be handled by _request_with_backoff() and the SDK internals.
Predefined exception types would be documented in the dev guide, to be optionally sent by custom implementations, and then caught and handled by the SDK.
We probably would want to introduce both "Fatal" and "Retryable" exception classes, and also add exception classes to signify rate limits.
- Rate limit exceptions could similarly be categorized as retryable or fatal, depending on period (per-minute limits being retryable and per-day limits not, for instance).
The set of status codes that roll up into each exception type should also be overridable by the developer.
- The example here uses tolerated_http_errors. We could support a RESTStream.tolerated_http_errors property or perhaps just allow users to override fatal_http_errors to exclude the error code in question.
- If the user overrides the numeric code list (RESTStream.tolerated_http_errors or RESTStream.fatal_http_errors, for instance), then they may or may not actually need to also override raise_if_error(). Presumably, they'd only need to do both when the status code is not sufficient information or when the response needs to be parsed in order to properly determine the nature of the exception message.
- Potential flaw in this plan is that if we want an entire series of codes to represent a failure, it may require adding all the codes in the sequence, something like range(start=400, stop=500). I'm not sure how much of an issue this would be, but wanted to call it out.

@ericboucher - What do you think of this approach? It should give you less you would need to override in cases such as these.

cc @edgarrmondragon

MeltanoLabs / tap-github

Allow GithubStream to continue on specific errors #16