apollographql / router

A configurable, high-performance routing runtime for Apollo Federation 🚀
https://www.apollographql.com/docs/router/
Other
775 stars 251 forks source link

Hot reload downtime under load #5185

Open utay opened 1 month ago

utay commented 1 month ago

Describe the bug

In production a router process serves hundreds of requests per second; when it hot reloads because the supergraph has been refreshed, a few requests get 502s from the load balancer because it got connection refused on the router.

To Reproduce

I've been able to reproduce pretty consistently:

Expected behavior

I'd expect 0 downtime as per the docs.

Additional context

Geal commented 1 month ago

hi, thank you for the report. This looks like an issue we have seen elsewhere, we'll investigate and get back to you

abernix commented 4 weeks ago

This might be related to https://github.com/apollographql/router/pull/5235, which should land reasonably soon. If you wanted to try with that PR included, that might be a worthwhile try. 😄