Alluxio Setup:
Version: Local build from latest master branch, 2.5.0-SNAPSHOT
Cluster mode: Single master, with Alluxio workers colocated with Presto worker nodes
Describe the bug
When Alluxio master is restarted, it fails to recognize the remove_table journal entry during journal replaying, and stuck there forever. Here's the log segment,
Alluxio Setup: Version: Local build from latest master branch, 2.5.0-SNAPSHOT Cluster mode: Single master, with Alluxio workers colocated with Presto worker nodes
Describe the bug When Alluxio master is restarted, it fails to recognize the
remove_table
journal entry during journal replaying, and stuck there forever. Here's the log segment,To Reproduce Restart Alluxio master after some table delete operation (Happened when syncing with Hive, some hive tables got dropped)
Expected behavior TableMaster should recognize and be able to handle
remove_table
entry, and journal replaying successfully finishesUrgency Blocker, since otherwise restarting Alluxio master would crash the cluster, and force format journal.