apache / beam

Apache Beam is a unified programming model for Batch and Streaming data processing.
https://beam.apache.org/
Apache License 2.0
7.8k stars 4.22k forks source link

[Feature Request]: Add soft-deadline before 1 minute hard-deadline to Spanner Change Stream IO connector #32473

Open dedocibula opened 1 week ago

dedocibula commented 1 week ago

Currently, Spanner Change Stream IO connector expects the beam framework to issue a split event either 5s or every 5MB of processed payload. In case that such event doesn't occur processing will continue until ultimately 1m pre-set timeout for Change Stream query is reached. This will cause the existing work to be dropped and query to be rescheduled. This feature tracks work for adding a soft deadline before the hard deadline that will allow to commit already processed work and updates internal processing low watermark

Issue Priority

Priority: 2 (default / most feature requests should be filed as P2)

Issue Components

dedocibula commented 1 week ago

.take-issue