aws / aws-parallelcluster

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.
https://github.com/aws/aws-parallelcluster
Apache License 2.0
818 stars 309 forks source link

Enable QUIC transport protocol for DCV on head node #6143

Open elduds opened 4 months ago

elduds commented 4 months ago

DCV on the head node leverages the default transport protocol configuration of WebSockets over TCP, but it can support QUIC with minor backwards-compatible configuration change.

I enable QUIC by default as my users are up to 300msec from our preferred Parallel Cluster region and experience significant latency, jitter & packet loss, which are far more tolerable using QUIC.

Regardless, given the performance improvements, I think the project should consider enabling QUIC by default, but at the very least make it optional, given that the worst-case outcome is any network or client issues when attempting QUIC simply result in graceful fallback to the existing TCP-based connections.

One complication is that QUIC is only supported using the DCV thick client, not via web browser.

Proposed changes required:

  1. Set enable-quic-frontend=true in /etc/dcv/dcv.conf
  2. Add UDP/ from in the head node security group
  3. Update pcluster dcv-connect to test for & launch the thick client after fetching session credentials
elduds commented 4 months ago

FWIW I'm using

himani2411 commented 2 weeks ago

Hi elduds,

I have added this in our Backlog. Thank you for the proposed changes.