Open gaoshihang opened 1 year ago
Hi @gaoshihang thanks for reporting this. Are able to identify which clean instant causes the OOM exception? How large are the <instant_time>.clean
files under .hoodie/
folder? I'm wondering if leveraging Spark to deserialize the clean metadata is going to help here.
Thanks! I will add some log and do some test to identify which clean instant causes this exception and how large it is.
@gaoshihang Did you hot the chance to work on above. Are you still facing this issue?
@gaoshihang Gentle ping.
I use hudi-cli(0.11.1 version) to do cleans show command, and I get a OOM exception:
Then I checked the code in CleansCommand.java and found that when I do cleans show, it will get all the clean first, and deserialize avro, which causes OOM.
can we do some optimization here?