numberlabs-developers / hudi

Upserts, Deletes And Incremental Processing on Big Data.
https://hudi.apache.org/
Apache License 2.0
0 stars 0 forks source link

[SUPPORT] Query about using Hudi in Pyspark for BigQuery sync #225

Open torvalds-dev-testbot[bot] opened 3 months ago

torvalds-dev-testbot[bot] commented 3 months ago

Tips before filing an issue

Describe the problem you faced

The user is inquiring about the support for using Hudi in Pyspark for syncing with BigQuery. They are unable to find any resources on how to implement this.

To Reproduce

Steps to reproduce the behavior:

1. 2. 3. 4.

Expected behavior

A clear and concise description of what the user expected to happen.

Environment Description

Additional context

Add any other context about the problem here.

Stacktrace

Add the stacktrace of the error.

torvalds-dev-testbot[bot] commented 3 months ago

Relevant Sources:

https://dzone.com/articles/delta-hudi-and-iceberg-the-data-lakehouse-trifecta

https://hudi.apache.org/docs/sql_queries

https://hudi.apache.org/docs/gcp_bigquery

https://hudi.apache.org/docs/syncing_metastore

https://github.com/apache/hudi/issues/7117

torvalds-dev-testbot[bot] commented 3 months ago

Ticket created successfully. Here is the link to the GitHub issue: https://github.com/numberlabs-developers/hudi/issues/225

torvalds-dev-testbot[bot] commented 3 months ago

Ticket created successfully. Here is the link to the GitHub issue: https://github.com/numberlabs-developers/hudi/issues/225

torvalds-dev-testbot[bot] commented 3 months ago

Hello! :wave: Please wait while I generate the follow up questions; this usually takes about 30 seconds

torvalds-dev-testbot[bot] commented 3 months ago

Follow up questions:

  1. Can you describe the version of Hudi and other tech stack in your environment?
  2. What specific Hudi write configurations are you using for BigQuery sync with Pyspark code?
  3. Could you provide the Pyspark code snippet you are currently using for implementing Hudi in BigQuery sync?

Sources:

torvalds-dev-testbot[bot] commented 3 months ago

Please connect your slack account on torvalds.dev to publish the message on behalf of the user and try again later. You can accept permission at - https://torvalds-dev-nl-ghco-nl-ghcos-projects.vercel.app/settings

torvalds-dev-testbot[bot] commented 3 months ago

Please connect your slack account on torvalds.dev to publish the message on behalf of the user and try again later. You can accept permission at - https://torvalds-dev-nl-ghco-nl-ghcos-projects.vercel.app/settings