apache / beam

Apache Beam is a unified programming model for Batch and Streaming data processing.
https://beam.apache.org/
Apache License 2.0
7.86k stars 4.26k forks source link

[Feature Request]: Add Java API for using and upgrading Iceberg via the Managed transforms API #30892

Closed chamikaramj closed 6 months ago

chamikaramj commented 7 months ago

What would you like to happen?

We recently added an API for Managed transforms: https://github.com/apache/beam/issues/30830

We are also working on adding an Iceberg source and sink.

We need to connect Iceberg to the Managed transforms API and add support for upgrading it.

Upgrading can be performed via the Transform Service [1] by adding Schema-aware transforms for Iceberg source and sink and implementing a PTransformPayloadTranslator [2].

[1] https://beam.apache.org/documentation/programming-guide/#transform-service-usage-upgrade [2] https://github.com/apache/beam/blob/58e4f8325fc0a2f7cd71bf9ccefea06a5ac6f75b/sdks/java/core/src/main/java/org/apache/beam/sdk/util/construction/PTransformTranslation.java#L597

Issue Priority

Priority: 2 (default / most feature requests should be filed as P2)

Issue Components

chamikaramj commented 6 months ago

Fixed by https://github.com/apache/beam/pull/30910