gctools-outilsgc / Kubernetes-config

All the Kubernetes manifest files required to deploy whatever we're running.
3 stars 3 forks source link

Sites with their database outside the cluster vnet randomly lose and stop being able to connect / connections time out to database #43

Closed Phanoix closed 1 week ago

Phanoix commented 1 week ago

Leaving this here as both documentation and for anyone else who might have run into this issue and might stumble on to this. In particular this has plagued gccollab for a while, in the end it was all down to SNAT port exhaustion because of the default outbound load balancer rules that assumes you're going to need enough ports for around 50 nodes minimum and connections will need to be allowed to be left idle for a while. This is the part of the docs that goes into detail on the defaults and what can be changed: https://learn.microsoft.com/en-us/azure/aks/load-balancer-standard#configure-the-allocated-outbound-ports.