Sheth-G / test-repo-app

MIT License
0 stars 0 forks source link

[SUPPORT] How to Install Hudi? #38

Open Sheth-G opened 2 weeks ago

Sheth-G commented 2 weeks ago

Tips before filing an issue

Describe the problem you faced How to Install Hudi?

To Reproduce Steps to reproduce the behavior: 1. 2. 3. 4.

Expected behavior A clear and concise description of what you expected to happen.

Environment Description

Additional context Add any other context about the problem here.

Stacktrace Add the stacktrace of the error.

torvalds-dev-testbot[bot] commented 2 weeks ago

Sharvil Sharvil posted: What is hudi? > Creator: Sharvil > Source Channel: triage-test-2 > Status: Unresolved > Assignee: Unassigned > Created: 6/11/2024 10:59 PM > Last Updated: 6/11/2024 10:59 PM :torvalds: Ask torvalds button Create Ticket :ticket: button

torvalds-dev-testbot[bot] commented 2 weeks ago

Relevant Sources:

https://apache-hudi.slack.com/archives/C4D716NPQ/p1711133001694489

https://www.linkedin.com/pulse/small-talk-apache-hudi-ashok-kumar-kunkala-3ldge/

https://medium.com/@karim.faiz/what-is-apache-hudi-e9363083830e

https://datacouch.medium.com/getting-started-with-apache-hudi-711b89c107aa

https://apache-hudi.slack.com/archives/C4D716NPQ/p1712128301197079 >torvalds.dev is learning and improving. React below to provide feedback! :+1: 0 button :-1: 0 button

torvalds-dev-testbot[bot] commented 2 weeks ago

Hello! :wave: Please wait while I search the knowledge base for an answer; this usually takes about 30 seconds

torvalds-dev-testbot[bot] commented 2 weeks ago

Follow up questions:

  1. Could you describe the version of Hudi and other tech stack in your environment?
  2. Have you configured the metastore uris for syncing Hudi tables to Hive?
  3. What's your indexing scheme?
  4. What table type are you using?
  5. Can you share your Hudi timeline?

Sources:

torvalds-dev-testbot[bot] commented 2 weeks ago

Apache Hudi is an open-source data management framework for Apache Hadoop-based data lakes. It stands for Hadoop Upserts Deletes and Incrementals and is designed for efficient data management in big data environments. Some key features of Apache Hudi include supporting efficient updates and deletes in data lakes, providing transactional writes with ACID support, storing data as delta files for fast querying, supporting schema evolution, and providing data indexing capabilities. It is widely used by companies like Uber, Alibaba, and Verizon Media for managing big data efficiently and reliably.

Source: https://medium.com/plumbersofdatascience/spark-etl-chapter-8-with-lakehouse-apache-hudi-d4794b8a79e6 >torvalds.dev is learning and improving. React below to provide feedback! :+1: 0 button :-1: 0 button

torvalds-dev-testbot[bot] commented 2 weeks ago

Could try using http://xyz.com

Sheth-G commented 2 weeks ago

Could tru using phy.com