Handshake with [x.x.x.x:xxxx] failed after session was established! Alert Protocol Level: FATAL Description: HANDSHAKE_FAILURE

eclipse-californium / californium

CoAP/DTLS Java Implementation

https://www.eclipse.org/californium/

Other

730 stars 367 forks source link

Handshake with [x.x.x.x:xxxx] failed after session was established! Alert Protocol Level: FATAL Description: HANDSHAKE_FAILURE #2200

Closed jvermillard closed 10 months ago

jvermillard commented 11 months ago

This issue arrives, sporadically, on some production servers using Cf 3.9.1 :

Handshake with [x.x.x.x:xxxx] failed after session was established! Alert Protocol Level: FATAL Description: HANDSHAKE_FAILURE

This doesn't make sense to me because if the session was established, then we had a successful handshake, but the reported alert is handshake failure.

It's difficult to capture since it's only some error in a lot of traffic, and the IP address of the faulty client appears to reconnect gracefully later.

boaks commented 11 months ago

ESTABLISHED is reached with the FINISH message of the other side. The server-side sends its FINISH as response (normal handshake). In order to keep prepared for receiving a retransmitted FINISH, the server stays in ESTABLISHED until it receives the first application data or a timeout occurs (accumulated timeout of assumed retransmissions). On timeout, the handshake is just completed. Not sure, what causes the ALERT, I would guess some records received from the client. At least, I don't see, where Cf fails a handshake after ESTABLISHED on it's own.

boaks commented 11 months ago

I found that log-message in the DTLSConnector lines 748 and 3039. Do you know the line number of your logging message?

boaks commented 11 months ago

I guess, adding the HandshakeException message to the log-message should provide some more hints to the root-cause. I will do that next week, before the 3.10. release ;-).

jvermillard commented 11 months ago

I found that log-message in the DTLSConnector lines 748 and 3039. Do you know the line number of your logging message?

I don't, I found only those two lines, I wonder why line 768 doesn't log the HandShakeException

BTW the DTLS client is some version of mbedtls

boaks commented 11 months ago

OK, I guess, add some more information to the log will brings this a step ahead.

boaks commented 11 months ago

I wonder why line 768 doesn't log the HandShakeException

It's always too less, or too much information or at the wrong place. Usually, If I tried to debug something, I added some more information to the log to see the cause and then I usually removed it again ;-).

jvermillard commented 11 months ago

https://github.com/eclipse-californium/californium/pull/2201

boaks commented 11 months ago

I found two places, where in my opinion a HANDSHAKE_FAILURE may be caused after the session is already ESTABLISHED:

unexpected message type

handshake message left

For the second I added some more details to see, what's left. See PR #2202