I ran into the issue after marking two sleds non_provisionable (more context in #5872). There were a number of TCP connections established by iperf3 running between instances on different sleds during that time. I noticed a large number of retransmits related to instances running on the sleds marked non_provisionable. I peeked into the opte tcp layer entries and saw that all the TCP connections to the impacted instances were in the SYNC_RCVD state:
The TCP connections were removed after some duration of inactivity and not left behind indefinitely. As such, feel free to close this as duplicate of #5872 if there is no other concern.
I ran into the issue after marking two sleds non_provisionable (more context in #5872). There were a number of TCP connections established by iperf3 running between instances on different sleds during that time. I noticed a large number of retransmits related to instances running on the sleds marked non_provisionable. I peeked into the opte tcp layer entries and saw that all the TCP connections to the impacted instances were in the
SYNC_RCVD
state:Stopping/starting the instances or putting sleds back to provisionable didn't allow these entries to be unstuck or cleared.