codecov / engineering-team

This is a general repo to use with GH Projects
1 stars 1 forks source link

Write reference/runbook for codecov-api "sync vs. async" root cause of bugs #2473

Open suejung-sentry opened 1 month ago

suejung-sentry commented 1 month ago

This task captures investigation into where development tripping up on sync vs. async has been the root cause of bugs in codecov-api.

Definition of Done - have some sort of technical reference of why/when this error occurs so developers in codecov-api can watch out for falling in the "trap" in their own code and when reviewing others' PRs. Also would be nice to have an on-call runbook for tips on debugging or recognizing when this is the root cause of the issue & how to fix (hot-fix or long-term fix).

If there are any outcomes from the investigation of a more systemic solution (linter? type checker?), documenting those as well so they can get picked up and done in some future work.

Past examples:

Some other old instances I saw in slack:

Here's a more recent one that Michelle fixed for us: https://github.com/codecov/codecov-api/pull/794 (guessing it cost the team at least a dozen developer-hours to figure out / fix)

suejung-sentry commented 1 week ago

Update - some learnings to date (to be collated after investigation done)

More details TBD as we learn more about this