instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Apache License 2.0
737 stars 90 forks source link

feat: refactor evaluator #1084

Closed sash-a closed 4 months ago

sash-a commented 4 months ago

What?

Refactor the evaluator to reduce code duplication and reduce memory usage.

Why?

The evaluator was one of the oldest functions in mava and it needed a bit of a rewrite as can be seen in the issues #996 and #1001. It was getting a bit complicated to extend and if you set the number of eval episodes too high it would try to create too many parallel environments.

How?

Closes: #996 and #1001 Also this achieves the goal #1071 so we can close it if this is merged