Architecture for Intermediaries

bemasc commented 5 months ago

The draft right now is pretty vague about how intermediaries are supposed to work. I think we need to get a lot more specific.

Right now I know of 5 kinds of relevant intermediaries:

TCP load balancers
QUIC load balancers
ECH terminators (ECH Split Mode CFS)
TLS terminators
HTTP gateways (with and without host rewriting)

Right now, my best idea is to employ the following rules:

Each participating entity in the chain (including the backend) is presumed to be represented by its own HTTP origin. This origin represents the full path to that entity, including all preceding intermediaries. a. This origin is not used by the preceding intermediary, because all of these origins carry the IP addresses of the first intermediary in the chain.
Each origin speaks only for itself.
Each entity draws SVCB config information only from the immediately preceding entity in the chain.

Suppose we have a complicated case: tcp.load.balancer.example -> ech.terminator.example -> http.gateway.example -> origin.example. This would work as follows:

The TCP load balancer would indicate that it does not support HTTP/3. https://tcp.load.balancer.example/.well-known/origin-svcb:

{
  "endpoints": [{
    "params": {
      "alpn": ["h2"]
    }
  }]
}

The ECH Terminator supports HTTP/3, but it would inspect the above, see that only HTTP/1.1 and HTTP/2 are supported, and remove any mention of HTTP/3.

https://ech.terminator.example/.well-known/origin-svcb:

{
  "endpoints": [{
    "regeninterval": "1000",
    "params": {
      "alpn": ["h2"],
      "ech": "..."
    }
  }]
}

The HTTP gateway would inspect the above and add any relevant parameters that are true across this gateway configuration. It would respect the regeninterval by periodically fetching the above JSON and regenerating its own JSON.

https://http.gateway.example/.well-known/origin-svcb:

{
  "endpoints": [{
    "regeninterval": "1000",
    "params": {
      "alpn": ["h2"],
      "ech": "...",
      "ohttp": ""
    }
  }]
}

Finally, the origin would do the same with the Gateway's JSON, adding any information it knows can safely be added: https://origin.example/.well-known/origin-svcb:

{
  "endpoints": [{
    "regeninterval": "1000",
    "params": {
      "alpn": ["h2"],
      "ech": "...",
      "ohttp": "",
      "dohpath": "/{?dns}"
    }
  }]
}

The zone factory would be configured with the name "origin.example" and A/AAAA records for that name that correspond to the TCP load balancer. It would use those IPs to request this last JSON file and convert it into a DNS record:

origin.example. IN 1000 1 . alpn=h2 ech=... ohttp dohpath=/{?dns}

Upsides:

Conceptually relatively simple.
Covers all our use cases.

Downsides:

Requires everybody behind an ECH terminator to run continuous dynamic updates.
Requires an HTTP origin to participate, which may not be natural for non-HTTP intermediaries.
A backend that wants to use WKECH to populate HTTPS records that are used by the gateway probably needs to have two different origins and use host rewriting.

Questions:

Can participants drop SvcParams that they know are irrelevant? What about unrecognized SvcParamKeys?
How does "mandatory" work in this world?
How does configuration fusion work for multiple "parallel" preceding intermediaries (e.g. multi-CDN)?

sftcd commented 5 months ago

That seems.... complicated;-)

I dunno if it's safe to assume that information about what to put in HTTPS RRs always flows from outside to inside like that. And thinking of e.g. haproxy as the ECH terminator it could support something (e.g. h3+ECH perhaps when someone does that) but would only enable that for some backends, so not sure that https://ech.terminator.example/.well-known/origin-svcb would work.

Part of me wants to try just do the minimum that'd work for some simpler cases that might be used by small hosters so that they can get into the ECH game, but to do that in a way that could be extended later to something more generic like the above, e.g. in a -bis RFC. (That also touches on #14 too of course.)

bemasc commented 5 months ago

thinking of e.g. haproxy as the ECH terminator it could support something (e.g. h3+ECH perhaps when someone does that) but would only enable that for some backends, so not sure that https://ech.terminator.example/.well-known/origin-svcb would work.

I'm not sure what you mean. The ECH terminator would expose all its capabilities on its own origin, and the the backends would subset those capabilities when publishing the origin-svcb for themselves. If the ECH terminator changes which capabilities are allowed for different origins, it would need separate origins to represent those separate capabilities.

Part of me wants to try just do the minimum

I think we're in agreement here. The point of this architecture is that the ZF only speaks to one source of truth, and doesn't merge configurations from disparate parties. Normatively, this whole architecture can probably be reduced to one sentence: "If the origin makes use of intermediaries, it is the origin's responsibility to ensure that the origin-svcb JSON document correctly accounts for their current configuration.".

sftcd commented 5 months ago

I'm not sure what you mean. The ECH terminator would expose all its capabilities on its own origin

A backend that does support h3 wouldn't know (for sure) if the ECH terminator will/won't proxy h3 for it. With haproxy that'd (IIUC) be down to the specifics of the haproxy config, and haproxy has a v. rich config language.

Or say the ECH terminator has 2 IPv4 addrs and one supports h3 while the other doesn't (for some UDP blocking reason)?

I'm not sure the "I get to do an automatically detectable proper subset of what the upstream guy can do" thing applies in general. For split-mode ECH though, such a setup would work for ECHConfigs.

bemasc commented 5 months ago

If the HAPROXY origin-svcb says it supports H3, then it supports client->proxy H3. (proxy->backend is a separate issue.) If it has different configurations for different backends, then it would need to separate those configurations into distinct origins.

Similarly, two IP addresses configured differently would need to be represented by separate HTTPS records, and hence separate entries in the "endpoints" array.

I think the subset logic works, but we don't have to specify it here. It's sufficient to be clear that as far as the ZF is concerned, only the origin's observable origin-svcb document matters, and how that's generated is not currently specified.

sftcd commented 5 months ago

If the HAPROXY origin-svcb says it supports H3, then it supports client->proxy H3. (proxy->backend is a separate issue.) If it has different configurations for different backends, then it would need to separate those configurations into distinct origins.

Similarly, two IP addresses configured differently would need to be represented by separate HTTPS records, and hence separate entries in the "endpoints" array.

I'm not sure of the above TBH, yes the upstream entity could create different origins (i.e. names for which it has webPKI certs) for those different things, but it seems unlikely to me.

I think the subset logic works, but we don't have to specify it here. It's sufficient to be clear that as far as the ZF is concerned, only the origin's observable origin-svcb document matters, and how that's generated is not currently specified.

I do however agree with the above, luckily:-)

I think we can also safely say that it'd be ok for an ECH terminator to publish an origin-svcb for the public_name that includes the latest ECHConfig and for backends "behind" that to automatically poll-for and use that ECHConfig in their own origin-svcb JSON. Whether such backends can automatically make use of other bits of the ECH terminator's origin-svcb JSON is less clear, and for future study.

sftcd / wkesni

Architecture for Intermediaries #21