Open userhimanshuverma opened 1 month ago
Hi @olaky, Could you please review this pull request and let me know if any changes are required.
@marmbrus @grundprinzip @rtyler Could you please review this PR.
Hi @olaky , Thank you for your feedback regarding my PR for adding Hive 4 support in Delta Lake. I understand your concern that the current changes appear to re-declare Hive 3 as Hive 4, rather than properly adding support for Hive 4. I appreciate your insights, and I'd like to clarify the intent and the proposed solution.
Current Changes:
Version Management:
Proposed Solution:
Next Steps:
Thank you for your guidance, and I'm looking forward to your thoughts on this proposed solution.
Hi, I did some more browsing, and at least per documentation Spark does not Support Hive version 4 yet. Did you validate that this really works?
Hi @olaky,
Thank you for your follow-up. I want to clarify that this PR is focused on enabling the creation of Delta tables directly through the Hive shell, rather than through the Spark shell.
Regarding Spark, I have tested version 3.5 with Hive 4.0.0 and validated that CRUD operations (create, read, update, delete) work successfully when Delta jars are added to the Spark library. However, the goal here is to provide support for creating Delta tables in the Hive shell itself, independent of Spark.
Currently, in Hive 3, it is possible to create Delta tables by building the Delta Uber jar. However, I encountered issues with Hive 4, which required adjustments to enable table creation through the Hive shell. The changes in this PR are meant to address these issues and provide Hive 4 compatibility for Delta tables.
I hope this clarification helps, and I look forward to your feedback on the proposed changes.
Sure, I will make the changes so that it won't affect.
Which Delta project/connector is this regarding?
Description
This pull request adds compatibility for Delta Lake with Apache Hive 4.0. This enhancement enables Hive 4.0 users to create delta tables.
Key Changes:
Why We Need This Change:
Integrating Delta Lake with Hive 4.0 bridges the gap between Hive users and the powerful features of Delta Lake. This change simplifies data workflows, allowing users to leverage Delta Lake’s robust features without leaving the Hive environment.
Resolved Issues:
This PR resolves any outstanding issues related to the lack of Delta Lake compatibility in Hive 4.0.
How was this patch tested?
Does this PR introduce any user-facing changes?
Yes, this PR introduces user-facing changes:
These changes enhance the existing capabilities of Hive, making it more versatile in managing large datasets with Delta Lake's advanced features.