In Spark Streaming Guide, it mentioned "Failed driver can be restarted from checkpoint information" and when executor failed, "Tasks and receivers restarted by Spark automatically, no config needed"
So my question is: does executor also needs this checkpoint directory for recovery?
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Hi,
I have a question on configuring checkpoint directory for spark application chart.
According to spark documentation https://spark.apache.org/docs/latest/streaming-programming-guide.html#checkpointing Spark needs distributed file system to store its checkpoint data so that in case of failure, it can recover from checkpoint directory.
In spark application helm chart I have a checkpointlocation configuration:
I created a checkpoint pvc and mount the volume for driver pod:
In Spark Streaming Guide, it mentioned "Failed driver can be restarted from checkpoint information" and when executor failed, "Tasks and receivers restarted by Spark automatically, no config needed"
So my question is: does executor also needs this checkpoint directory for recovery?
Thanks, Wenjing