camunda / camunda-docs

Camunda 8 Documentation, including all components and features
https://docs.camunda.io/
Other
54 stars 182 forks source link

Document recommended configuration for worker job streaming with reverse proxies #3903

Closed npepinpe closed 2 months ago

npepinpe commented 3 months ago

Description

We had two incidents now due to long living job streams being closed unexpectedly with 504 errors. This is most prominent with nginx if users configure the parameter grpc_read_timeout, as idle workers will have no "response" possibly for a long time. It seems the nginx project has no plans to forward the HTTP/2 pings, or consider those to keep the connection alive.

So the recommendation for now is the following:

While this is nginx specific, it can be useful for other reverse proxies (e.g. Traeffik) which may have similar issues.

My only question here is where this kind of documentation should lived - in the self-managed operation guides (but where?), or with the job worker documentation. For this, please consult someone from the DevEx team for their opinions during kickoff.

Context

npepinpe commented 3 months ago

Please adjust docs if https://github.com/camunda/camunda/issues/19188 is accepted by the ZPA team accordingly to reference the new default value as well, and suggest a grpc_read_timeout (or equivalent in other ingresses) of twice that.

akeller commented 3 months ago

My only question here is where this kind of documentation should lived - in the self-managed operation guides (but where?), or with the job worker documentation. For this, please consult someone from the DevEx team for their opinions during kickoff.

🧡 Thank you! When ready, please connect with @conceptualshark, as our emerging DRI of Self-Managed docs.

npepinpe commented 3 months ago

@conceptualshark - I'm working off a draft right now to put this in self-managed as a new page under zeebe-deployment/zeebe-gateway. I would also link the respective client pages to this page of course.

I'm not sure if this is the best place though. We don't have a page about ingress config in general or reverse proxy usage either, where it could potentially also go.

akeller commented 2 months ago

@conceptualshark - I'm working off a draft right now to put this in self-managed as a new page under zeebe-deployment/zeebe-gateway. I would also link the respective client pages to this page of course.

I'm not sure if this is the best place though. We don't have a page about ingress config in general or reverse proxy usage either, where it could potentially also go.

👆 @conceptualshark ICYMI