instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Apache License 2.0
709 stars 83 forks source link

feat: add dones to episode metrics #1018

Closed sash-a closed 7 months ago

sash-a commented 7 months ago

What?

Improves the RecordEpisodeMetrics wrapper by adding dones, which allows us to pull out the final reward from the episode. Thus instead of logging the mean of many final episode returns we now log the mean of just the last episode return.

Why?

More accurate representation of system performance.