apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.
https://hudi.apache.org/
Apache License 2.0
5.32k stars 2.41k forks source link

unable to connect hudi from hive CLI #11476

Closed Pavan792reddy closed 3 weeks ago

Pavan792reddy commented 2 months ago

Tips before filing an issue i have used hudi deltastreamer to load the data into hudi and read the data from hive CLI , we are facing the below error. when i am trying to read JDBC of beeline i can able to read the data .

Describe the problem you faced

A clear and concise description of the problem.

To Reproduce

Steps to reproduce the behavior:

1. 2. 3. 4.

Expected behavior

A clear and concise description of what you expected to happen.

Environment Description

Additional context

Add any other context about the problem here.

Stacktrace

Add the stacktrace of the error.

ERROR :-

hive> select * from avroschema_20; FAILED: RuntimeException java.lang.ClassNotFoundException: org.apache.hudi.hadoop.HoodieParquetInputFormat hive>

ad1happy2go commented 2 months ago

@Pavan792reddy have you added hudi-mr-bundle to the hive lib?

Pavan792reddy commented 2 months ago

i mean in hive-site.xml file ?, i have added those details into the xml file and the xml file also copied into the /etc/spark/conf/ directory as well.

ad1happy2go commented 2 months ago

No add jar in hive lib. https://hudi.apache.org/docs/syncing_metastore/#hive-environment

ad1happy2go commented 3 weeks ago

@Pavan792reddy Confirmed with @Pavan792reddy . It was fixed. Thanks.