Open zrtvwp opened 2 years ago
@zrtvwp img2dataset supports incremental mode for non-downloaded shards on restart.
https://github.com/rom1504/img2dataset#api
incremental_mode: Can be "incremental" or "overwrite". For "incremental", img2dataset will download all the shards that were not downloaded, for "overwrite" img2dataset will delete recursively the output folder then start from zero (default incremental)
That exception message can be caused by an instance being terminated using a preemptible secondary worker. In this case, dataproc will automatically restart the instance.
The hang issue seems to be related to https://github.com/rom1504/img2dataset/issues/187, but I'm not sure how to fix it yet :(
I think it will start downloading again in incremental mode. There seems to be a problem with Spark's task scheduling. If you run the task again, it will start downloading again in incremental mode.
For the third time in a row it hangs in the same place. Sometimes it just freezes, sometimes end up flooding org.apache.spark.network.server.TransportChannelHandler errors. It also bothers me because after each restart of the Job bucket increases in size and it is not clear to me, either he downloads what was unavailable the last time, or some action repeats and downloads the same files.
log1
log3
log3