ExpediaGroup / apiary-federation

Terraform scripts for deploying Apiary Data Lake federation
https://github.com/ExpediaGroup/apiary
Apache License 2.0
5 stars 10 forks source link

Override keepAlive time to be lower then NLB idle time (350s) #130

Closed patduin closed 10 months ago

patduin commented 10 months ago

See for instance this blogpost: https://paramount.tech/blog/2021/07/26/mitigation-of-connection-reset-in-aws.html and https://medium.com/tenable-techblog/lessons-from-aws-nlb-timeouts-5028a8f65dda We've seen a reduced number of 10 mins (connection timeout) reported by waggle dance once we set this lower TCP keepalive settings. Going from 30-40 timeout calls an hour to single digit. Occurrences still happen we suspect because the server side (HMS) should also set similar TCP KeepAlive settings.