bug: github-search rate limiting

I added PRs and issues streams to the github search EL and were starting to hit the rate limits and the Airflow job wont complete. We do a search for taps and targets so every result gets a bunch of follow on requests for PRs/Issues/Readme content and I think our result set is too big to ever complete within the rate limit ranges. Even with incremental loads we still hit ever repo every time for updates and if we have 5k repos then our 5k request limit is used up quickly.

Limit search query further
Add more auth tokens
Split EL jobs somehow
Increase Airflow's retry time to 1 hr
Add a configurable feature to tap-github like throttle_requests to stay within rate limit. If the limit is reached it should sleep vs hard failing.

Challenges with each:

I can figure out how to do this. Github's search seems to be very inexact. For example our non-fork target search criteria brings back https://github.com/andabi/deep-voice-conversion and a top result and includes many taps. I'm only retrieving non-forks for now until this issue is resolved.
the tap accepts a list of auth tokens which would help but its a user level rate limit so we need auth tokens from multiple accounts. Idk how we'd do that and mange them.
This seems like a hack, we'd expect it to continue failing every time but rely on airflow's retries to allow it to eventually finish.
Same as above
It would work but then theres wasted compute just hanging around waiting for the rate limit to reset. So thats not ideal.

@aaronsteers any thoughts on this since you've worked with this tap before and run into similar problems?

meltano / squared

bug: github-search rate limiting #409