Open torvalds-dev-testbot[bot] opened 4 months ago
Hello! 👋 Please wait while I search the knowledge base for an answer; this usually takes about 30 seconds
To install Hudi, you can follow different methods based on your use case and environment:
For local setup after building Hudi, you can use the hudi-cli by navigating to the hudi-cli directory and running ./hudi-cli.sh.
If you are using Spark 3, you can launch the Hudi CLI using the hudi-cli-bundle. Ensure you have spark installed locally with the required dependencies and set the necessary environment variables.
On AWS EMR, Hudi comes pre-installed. You just need to pass the required jars and configurations in the spark-submit command.
To integrate Hudi with Spark, you can start using it with Spark shell by running specific commands depending on your Spark version. Hudi works with Spark 2.4.3+ and Spark 3.x versions.
For AWS Glue, you can enable Hudi functionality by setting specific parameters in your AWS Glue job and configurations.
Sources:
Ticket created successfully. Here is the link to the GitHub issue: https://github.com/numberlabs-developers/hudi/issues/220
Ticket created successfully. Here is the link to the GitHub issue: https://github.com/numberlabs-developers/hudi/issues/220
Tips before filing an issue
Describe the problem you faced A clear and concise description of the problem.
To Reproduce Steps to reproduce the behavior: 1. 2. 3. 4.
Expected behavior A clear and concise description of what you expected to happen.
Environment Description
Additional context Add any other context about the problem here.
Stacktrace Add the stacktrace of the error.