Open codope opened 1 year ago
trino-hudi module adds hudi-common
, hudi-hadoop-mr
, hudi-client-common
individually. Instead, we should consider replacing the three dependencies with the hudi-trino-bundle
.
Current workaround is to add the hudi-trino-bundle
in plugin path (<trino_install_dir>/plugin/hudi
).
Describe the problem you faced Original issue: https://github.com/trinodb/trino/issues/15368
The issue was resolved by placing some dependencies in the classpath. Interestingly, those dependencies are already included in the trino-hudi-bundle. This particular issues tracks any gap in packaging.
To Reproduce
Steps to reproduce the behavior:
hudi.metadata-enabled=true
.Trino Hudi Connector Properties:
Hudi Properties set while writing:
General information of table: Total rows = 1,213,959,199 Total Partitions = 2400+ Total file objects = 120,000 Total Size on S3 = 12~13 GB The table was upgraded from 0.9.0 to 0.10.1
Coordinator Relevant Logs:
Expected behavior
They query should work out-of-the-box without having to place jars in classpath.
Environment Description
Hudi version : 0.10.1
Spark version : 2.4
Trino version : 400
Hadoop version :
Storage (HDFS/S3/GCS..) :
Running on Docker? (yes/no) : no
Additional context
Add any other context about the problem here.
Stacktrace
Full stacktrace in Partitioned_COW_Hudi_Coordinator_logs.log