Farama-Foundation / MicroRTS-Py

A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)
MIT License
234 stars 45 forks source link

Log discounted returns #52

Closed vwxyzjn closed 2 years ago

vwxyzjn commented 2 years ago

This PR also logs discounted returns. This metric is useful because the agent actually optimizes for chunks of discounted returns (chunks defined by num_envs * num_steps). I also logged the episodic length as well and change the metrics name from episode_reward to episodic_return to more accurately reflect the metric.