Sheth-G / hudi

Upserts, Deletes And Incremental Processing on Big Data.
https://hudi.apache.org/
Apache License 2.0
0 stars 0 forks source link

[SUPPORT] How to install Apache Hudi #10

Open torvalds-dev-testbot[bot] opened 3 weeks ago

torvalds-dev-testbot[bot] commented 3 weeks ago

Tips before filing an issue

Describe the problem you faced

How to install Apache Hudi

To Reproduce

Steps to reproduce the behavior:

1. 2. 3. 4.

Expected behavior

A clear and concise description of what you expected to happen.

Environment Description

Additional context

Add any other context about the problem here.

Stacktrace

Add the stacktrace of the error.

torvalds-dev-testbot[bot] commented 3 weeks ago

Relevant Sources:

https://apache-hudi.slack.com/archives/C4D716NPQ/p1711133001694489

https://github.com/apache/hudi/blob/master/README.md

https://hudi.apache.org/docs/syncing_metastore

https://api.github.com/repos/apache/hudi/issues/8141

https://datacouch.medium.com/getting-started-with-apache-hudi-711b89c107aa >torvalds.dev is learning and improving. React below to provide feedback! 👍 0 button 👎 0 button

torvalds-dev-testbot[bot] commented 3 weeks ago

Hello! 👋 Please wait while I search the knowledge base for an answer; this usually takes about 30 seconds

torvalds-dev-testbot[bot] commented 3 weeks ago

Based on the provided documentation snippets, to install Apache Hudi on your EMR machine, follow these steps:

  1. Verify that EMR version is compatible with the instructions.
  2. Pass the necessary jars and configurations in the spark-submit command.
  3. Use the hudi-spark-bundle.jar and spark-avro.jar that come pre-installed on EMR.
  4. Ensure compatibility between EMR and Hudi versions, as they may change with different EMR versions.
  5. Remember, using Hudi on EMR is straightforward due to pre-installed components.

Sources:

torvalds-dev-testbot[bot] commented 3 weeks ago

[[A concise and precise answer]] To install Hudi on AWS EMR, you can utilize the pre-installed Hudi on EMR by passing the necessary jars and configurations in the spark-submit command. Make sure to adjust the steps based on the EMR version you are using as Hudi versions may vary across different EMR versions.

[[Follow up questions]]

[[Sources]]

torvalds-dev-testbot[bot] commented 3 weeks ago

okay thanks

Sheth-G commented 3 weeks ago

thank you too

torvalds-dev-testbot[bot] commented 3 weeks ago

How to install hudi

torvalds-dev-testbot[bot] commented 3 weeks ago

How to install hudi