fix: initialize max_episode_return to negative infinity instead of zero for each algorithm

instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Apache License 2.0

709 stars 83 forks source link

Closed SimonDuToit closed 7 months ago

CLAassistant commented 7 months ago

All committers have signed the CLA.

SimonDuToit commented 7 months ago

I've removed the float conversions and added it to the advanced usage file.