GoogleCloudPlatform / DataflowTemplates

Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
https://cloud.google.com/dataflow/docs/guides/templates/provided-templates
Apache License 2.0
1.16k stars 977 forks source link

Feature request for Datastream to BigQuery (Stream) template to support PostgreSQL as source and PostgreSQL merges #1765

Open AnandInguva opened 3 months ago

AnandInguva commented 3 months ago

Related Template(s)

Datastream to BigQuery (Stream) template

What feature(s) are you requesting?

Customer attempted to use the Datastream to BigQuery (Stream) template with PostgreSQL as their source database, intending to leverage merge functionality in BigQuery. However, they ran into a metadata error at point of merging in BigQuery (_metadata_lsn is unrecognized)

Enhance the Datastream to BigQuery (Stream) template to officially support PostgreSQL merges. This would align the template with customer expectations and provide a streamlined solution for replicating and merging PostgreSQL data into BigQuery.

Also, Update and clarify on the public documentation of the Dataflow: Datastream to BigQuery (Stream) template its capabilities and limitations, specially concerning PostgreSQL

SargisPlusPlus commented 2 days ago

Related https://github.com/GoogleCloudPlatform/DataflowTemplates/pull/610