JEP: Websocket token authentication with subprotocols

minrk commented 5 months ago

JEP for #119

SylvainCorlay commented 3 months ago

I guess that the retry pattern will just be a bit more complicated when dealing with the aligned kernel subprotocol?

minrk commented 3 months ago

I guess that the retry pattern will just be a bit more complicated when dealing with the aligned kernel subprotocol?

I'll need to check. In all cases, the first request should include the kernel subprotocol and the token subprotocol, with the kernel subprotocol first indicating highest priority. There will be these possible cases to handle in terms of server support:

supports both:
- first request succeeds, subprotocol set to kernel subprotocol
supports token but not kernel:
- first request succeeds, subprotocol is set to token subprotocol - this indicates kernel subprotocol is not supported
supports kernel subprotocol but not token (this is currently released servers):
- if cookie authentication available, succeeds and kernel subprotocol is set
- if cookie authentication is not available, fails with 403. If this were a normal HTTP request, that would be easy to identify, but websockets obfuscate errors so it may be hard to distinguish from other failures
- second request with kernel subprotocol and token in URL succeeds
supports neither token nor kernel:
- if cookie authentication available, fails with no supported subprotocol
- (if auth failure is indistinguishable, which I believe it is) second request with token in url fails with no supported subprotocol
- third request with token in url and no subprotocol succeeds
- if cookie authentication is not available, fails with 403 (indistinguishable, I think)
- same as above for the rest

Which means the first request reliably determines whether the token can be in the subprotocol. If the first request fails, the token shall be in the URL parameter. The second request then determines kernel subprotocol support (the first request in current jupyterlab), and in the event of a server supporting neither subprotocol, a third and final request is needed to make a successful connection.

So if token auth is supported by the server, retries are no longer needed to check for support of the kernel subprotocol, which is a plus, I suppose. But in order to handle all possible cases, there could be up to 2 retries instead of the current 1, assuming I'm correct that clients can't meaningfully distinguish between a websocket that fails due to missing auth vs unsupported protocols. It could be simplified if the two failures turn out to be distinguishable in the client.

It may have been a good idea to define a subprotocol version string for the older wire format so that servers could explicitly declare that they only support the older wire format. But I'm not sure if that's worth discussing at this point, because that would reduce the number of cases where the retry is needed.

minrk commented 3 months ago

I've run some tests, and browsers don't appear to record the http response (essentially, the WebSocket in browser seems to pretend that websockets are not built on top of HTTP, so expose nothing about the HTTP requests/responses to client code). So clients need to treat all connection failures as indistinguishable:

no supported protocol
auth error
404
500
not a websocket endpoint at all
lost connection

I'm not necessarily proposing we do this, but so folks can have an informed opinion, if the client knows whether the token subprotocol is supported before attempting websocket requests, the conditions look like this:

server supports token protocol:
- first request fails: means auth error, no retry
- first request succeeds:
- selected subprotocol will be kernel if kernel subprotocol supported, token if kernel subprotcol not supported
server doesn't support token protocol (token in url, same as now):
- first request fails: auth error or unsupported protocol
- second request without kernel protocol:
  - succeeds if kernel subprotocol supported
  - fails if it was really an auth error or other problem

So it is simpler, especially since the current subprotocol retries can be eliminated if the token subprocol is known or assumed to be supported. But it adds the preflight specification to somehow communicate that token-authenticated websockets are supported, which we haven't decided on, and don't currently have a mechanism for.

vidartf commented 3 months ago

@minrk is it possible to (ab)use the reason field in the close event? Or will that undermine some of the security considerations of websockets?

minrk commented 3 months ago

is it possible to (ab)use the reason field in the close event?

Yeah, I hadn't thought of that, but we could. I don't think it's abuse, I think it's actually what close code/reason are intended for. So instead of not accepting the connection in the first place, accept the connection and immediately close with a code (e.g. 4403 - 4000 + status code, since unregistered websocket codes should be in 4000-4999). This has an advantage in that it would actually give us a place for communicating the reason for the close, which is helpful, and maybe what we should have been doing all along.

There are backward-compatibility downsides to the transition, at least:

existing clients don't do this, so connection errors must still be handled by clients. But at least we should know that a connection error doesn't mean supporting token protocol + unauthorized token, if servers behave as intended
from a Python perspective, it may be complicated to properly accept and immediately close connections without triggering unauthorized on_open/get/pre_get side effects, or accidentally accept unauthorized connections because currently open is assumed to be protected to only be called on authorized requests, and with such a change, open would end up being called (my idea here is to patch out self.open = self.open_and_immediately_close and self.get = WebSocketHandler.get, but if overridden close assumes open has been called, errors are likely and hard to avoid.

jupyter / enhancement-proposals

JEP: Websocket token authentication with subprotocols #121