neondatabase / neon

Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
https://neon.tech
Apache License 2.0
14.51k stars 422 forks source link

Configure proxy -> compute keepalives #4306

Open arssher opened 1 year ago

arssher commented 1 year ago

User set suspend_timeout to 20s after which endpoint frequently restarted and at the same time became unavailable for the deployed app for 15-30 minutes. Seems like proxy -> compute is one more place where we miss timeouts. However there is no KAs in main postgres protocol, so probably we just need to configure TCP keepalives/timeouts on the kernel level.

copied from https://neondb.slack.com/archives/C04DGM6SMTM/p1684338654994679

stradig commented 7 months ago

We are not sure about the right solution. We need to investigate this. Estimate size is for the investigation only.