apache / beam

Apache Beam is a unified programming model for Batch and Streaming data processing.
https://beam.apache.org/
Apache License 2.0
7.76k stars 4.21k forks source link

[Bug]: [Flink] JdbcIO metrics are always `NaN` when using the FlinkRunner #31544

Open oswidan97 opened 3 months ago

oswidan97 commented 3 months ago

What happened?

Specifically these 2 metrics in JdbcIO that should be updated when inserting into the sink database, always have the value NaN, even though data is being ingested in the sink database that I am using.

I am using the apache beam version 2.56.0 and beam-runners-flink-1.17

Screenshot from the FlinkUI image

Issue Priority

Priority: 2 (default / most bugs should be filed as P2)

Issue Components

je-ik commented 3 months ago

Thanks for the report. A few questions, if you don't mind:

oswidan97 commented 3 months ago

Thanks for the report. A few questions, if you don't mind:

  • is your Pipeline bounded or unbounded?
  • can you please share any other command-line arguments/configuration passed to the job?
  • if bounded, can you verify if the metrics are available after the job terminates via PipelineResult?
je-ik commented 3 months ago

Thanks. Could build patched version of Beam and try to log the contents of the container in here?