Open geertn opened 1 month ago
Thanks for letting know.. We are tracking the issue here https://github.com/Azure/azure-container-networking/issues/2703 and will be working on a fix. I will keep this thread updated
I see the fix is merged, thank you for that. Will this be included in the upcoming node image release?
Describe the bug The process
azure-vnet-telemetry
seems to be leaking sockets. Errors are also present in the logs. Every error line corresponds to a socket leak. They happen on multiple nodes in the AKS cluster but not all. The issue has been present for weeks and caused a production outage because of too many open sockets.To Reproduce
See if the node produces errors like this:
Look at open sockets:
Most of the times the error and amount of open sockets correspond
Expected behavior No errors and no leaking sockets.
Environment (please complete the following information):
Additional context
logfile.txt
Open support issue 2403170050000665