Join the mailing list to engage in conversations and get faster support at dev-subscribe@hudi.apache.org.
If you have triaged this as a bug, then file an issue directly.
Describe the problem you faced
I am getting duplicated records when using insert overwrite. There are multiple commit times in the hoodie table and duplicated records exist after using insert overwrite into the target table. The query involves joining 10 tables.
Environment Description
Hudi version : 0.9
Spark version : 3.0.1
Hive version : 3.2
Hadoop version : 3.2
Storage (HDFS/S3/GCS..) : s3
Running on Docker? (yes/no) : no
Expected behavior
A clear and concise description of what you expected to happen.
Additional context
Add any other context about the problem here.
Tips before filing an issue
Describe the problem you faced I am getting duplicated records when using insert overwrite. There are multiple commit times in the hoodie table and duplicated records exist after using insert overwrite into the target table. The query involves joining 10 tables.
Environment Description
Expected behavior A clear and concise description of what you expected to happen.
Additional context Add any other context about the problem here.
Stacktrace Add the stacktrace of the error.