apache / beam

Apache Beam is a unified programming model for Batch and Streaming data processing.
https://beam.apache.org/
Apache License 2.0
7.81k stars 4.23k forks source link

Raise Java Http KeepAlive Cache Size #21575

Open damccorm opened 2 years ago

damccorm commented 2 years ago

Java HTTP[S]Connection's keep alive by default keeps 5 idle connections alive. Dataflow user workers could make 1000s of connection to the same endpoint. The HTTPS handshake overhead is significant in pipelines using BigQueryIO streaming inserts and bumping up the keep alive limit reduces it. The keep alive connection count can be changed by setting the JVM system property "http.maxConnections". 

Imported from Jira BEAM-14437. Original Jira may contain additional context. Reported by: arunpandianp.

damccorm commented 2 years ago

Unable to assign user @arunpandianp. If able, self-assign, otherwise tag @damccorm so that he can assign you. Because of GitHub's spam prevention system, your activity is required to enable assignment in this repo.