This PR fixes the data downloading issue in the example code.
Reproduce: On a cluster with multiple GPUs per worker node, with spark.resources.tasks.gpu.amount set to 1, running the original example will trigger an error related to data downloading.
Cause: There will be multiple tasks running on the same worker and each task will try to write the data to the same path, which will corrupt the data.
This PR fixes the data downloading issue in the example code.
Reproduce: On a cluster with multiple GPUs per worker node, with spark.resources.tasks.gpu.amount set to 1, running the original example will trigger an error related to data downloading.
Cause: There will be multiple tasks running on the same worker and each task will try to write the data to the same path, which will corrupt the data.
Fix: Randomize the file path.