apache / beam

Apache Beam is a unified programming model for Batch and Streaming data processing.
https://beam.apache.org/
Apache License 2.0
7.86k stars 4.25k forks source link

Message limit for grpc message is set to default 4 MB #20455

Open damccorm opened 2 years ago

damccorm commented 2 years ago

The default limit of a grpc message is set low to 4 MB, and is not overridden in the sdk_worker.py for the _control_channel. This was first discovered when submitting a beam job via a flink operator orchestrated by kubernetes (https://github.com/GoogleCloudPlatform/flink-on-k8s-operator). This issue was resolved for apache-beam\==2.18 in BEAM-10334, but exists in apache-beam\==2.22 (and presumably all versions 2.17 onwards). 

 

This can be fixed by modifying the options parameter to allow any size grpc message to be sent or received.

Imported from Jira BEAM-10458. Original Jira may contain additional context. Reported by: kshivvy.

kshivvy commented 2 years ago

Here is the PR that resolved this issue for apache-beam==2.18, but not for the other versions.

CC: @charlesccychen