Data corruption due to run naming convention when running on Slurm/GridEngine

Problem Description

In many CleanRL scripts, a timestamp is used as a differentiator in the naming of the jobs:

https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/ppo.py#L134

In some very rare cases (when running on a slurm cluster or Sun Grid Engine for example) two scripts may be executed within a second of each other and end up with the same timestamp.

If a shared drive is used to store results (very common for these cluster setups) then the jobs can actually overwrite each other's data. You will end up with a bunch of very wierdly similar looking runs in wandb,

Checklist

[x] I have installed dependencies via poetry install (see CleanRL's installation guideline.
[x] I have checked that there is no similar issue in the repo (required)

Current Behavior

Data overwritten due to shared drives and problematic naming convention which causes collisions.

wandb/tensorboard might throw an error but I've only ever seen this once.

Expected Behavior

Data should not be overwritten and runs should always have unique names.

Possible Solution

I actually have replaced time.time() with uuid.uuid4() which is extemely unlikely to cause collisions.

Steps to Reproduce

steps to reproduce are a bit pointless unless you have access to a fairly empty cluster, however I believe the bug is trivial enough to require no repro steps to understand.

vwxyzjn / cleanrl