numberlabs-developers / hudi

Upserts, Deletes And Incremental Processing on Big Data.
https://hudi.apache.org/
Apache License 2.0
0 stars 0 forks source link

[SUPPORT] Support for multiple file formats in the same SparkSession #235

Open torvalds-dev-testbot[bot] opened 3 months ago

torvalds-dev-testbot[bot] commented 3 months ago

Tips before filing an issue

Describe the problem you faced A clear and concise description of the problem: Is it possible to support both Hudi and Delta Lake file formats in the same SparkSession for Spark 3.2 and above?

To Reproduce Steps to reproduce the behavior: 1. 2. 3. 4.

Expected behavior A clear and concise description of what you expected to happen.

Environment Description

Additional context Add any other context about the problem here.

Stacktrace Add the stacktrace of the error.

torvalds-dev-testbot[bot] commented 3 months ago

Relevant Sources:

https://hudi.apache.org/docs/configurations

https://hudi.apache.org/docs/quick-start-guide

https://blog.datumagic.com/p/apache-hudi-from-zero-to-one-910?r=2fl10k&utm_campaign=post&utm_medium=web

https://apache-hudi.slack.com/archives/C4D716NPQ/p1713511341371689

torvalds-dev-testbot[bot] commented 3 months ago

Not satisified with the answer

torvalds-dev-testbot[bot] commented 3 months ago

Hello! 👋 Please wait while I search the knowledge base for an answer; this usually takes about 30 seconds

torvalds-dev-testbot[bot] commented 3 months ago

Based on the provided documentation, to support both Hudi and Delta Lake in the same SparkSession, you need to use the spark_catalog configuration parameter accordingly:

Follow-up questions:

Sources:

torvalds-dev-testbot[bot] commented 3 months ago

Abhishek posted:

https://devs-numberlabs.slack.com/archives/C06PDCQPA3W/p1722424798679249?thread_ts=1722425335319449&cid=C06PDCQPA3W

torvalds-dev-testbot[bot] commented 3 months ago

Ticket created successfully. Here is the link to the GitHub issue: https://github.com/numberlabs-developers/hudi/issues/235

torvalds-dev-testbot[bot] commented 3 months ago

Ticket created successfully. Here is the link to the GitHub issue: https://github.com/numberlabs-developers/hudi/issues/235