Open koochiswathiTR opened 1 year ago
Did you enable the lazy cleaning for multi writers?
@danny0405 We are not using multi writer, Its only single writer. What is lazy cleaning can you brief on this?
What version of Hudi did you use, seems an uknown bug.
can you post the contents of ".hoodie" w/ last mod time intact (ls -ltr). Also, when you triggered rollback via cli, whats the entire command you passed.
I see we have an option --rollbackUsingMarkers
. did you set it or no ?
we also made some fix on rolling back a completed instant https://github.com/apache/hudi/pull/6313. can you try 0.12.1 may be.
Did you enable the lazy cleaning for multi writers?
If there are multiple parallelism writes for different partitions of the same table, does HUDI have any plans to optimize this? For example, a Spark SQL is split into multiple applications to write.
Guess not, becase until committing, Hudi has no idear whether the two(or more) commits have conflicts, so the rollback plan should execute from the latest instant to the oldest, to ensure the validity of the data set.
Have you gone through our FAQs?
Join the mailing list to engage in conversations and get faster support at dev-subscribe@hudi.apache.org.
If you have triaged this as a bug, then file an issue directly.
Hi, Our streaming job is failing with Found commits after time :20230220161017756, please rollback greater commits first,
We tried to rollback commits with the command commit rollback --commit commit_num but we are facing the below exception Caused by: java.lang.IllegalArgumentException: Cannot use marker based rollback strategy on completed instant:[20230221130513323deltacommitCOMPLETED] at org.apache.hudi.common.util.ValidationUtils.checkArgument(ValidationUtils.java:40) at org.apache.hudi.table.action.rollback.BaseRollbackActionExecutor.(BaseRollbackActionExecutor.java:93)
at org.apache.hudi.table.action.rollback.BaseRollbackActionExecutor.(BaseRollbackActionExecutor.java:73)
at org.apache.hudi.table.action.rollback.MergeOnReadRollbackActionExecutor.(MergeOnReadRollbackActionExecutor.java:48)
at org.apache.hudi.table.HoodieSparkMergeOnReadTable.rollback(HoodieSparkMergeOnReadTable.java:170)
at org.apache.hudi.client.BaseHoodieWriteClient.rollback(BaseHoodieWriteClient.java:766)
... 15 more
Our Hudi configs are
Please help what caused this issue Help us how to resolve this
Environment Description AWS
Hudi version : 0.11.1
Spark version : 3.1.2
Hive version : NA
Hadoop version :3
Storage (HDFS/S3/GCS..) :S3
Running on Docker? (yes/no) :no
Additional context
Add any other context about the problem here.
Stacktrace
Add the stacktrace of the error.