In random agent script wandb full episode data logging skips a few steps. This is because wandb counts the epsiode reward logging steps made prior to the full data logging.
Potential Solution
Add another metric to log that shows timestep and day (proportional).
Problem
In random agent script wandb full episode data logging skips a few steps. This is because wandb counts the epsiode reward logging steps made prior to the full data logging.
Potential Solution
Add another metric to log that shows timestep and day (proportional).