In random agent script wandb full episode data logging skips a few steps

rdnfn / beobench

A toolkit providing easy and unified access to building control environments for reinforcement learning (RL).

https://beobench.readthedocs.io

MIT License

37 stars 4 forks source link

In random agent script wandb full episode data logging skips a few steps #67

Closed rdnfn closed 2 years ago

rdnfn commented 2 years ago

Problem

In random agent script wandb full episode data logging skips a few steps. This is because wandb counts the epsiode reward logging steps made prior to the full data logging.

Potential Solution

Add another metric to log that shows timestep and day (proportional).

rdnfn commented 2 years ago

This has been implemented and will be shipped with v0.4.4 🚀