instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Apache License 2.0
740 stars 91 forks source link

[FEATURE]: MAD4PG Quantile Regression #422

Closed DriesSmit closed 2 years ago

DriesSmit commented 2 years ago

Please describe the purpose of the feature. Is it related to a problem?

To improve MAD4PG's performance while also reducing its hyperparameters which reduces possible user error.

Describe the solution you'd like

Implement Quantile Regression for MAD4PG instead of fixing the distributional value head's range manually. Paper: https://arxiv.org/pdf/1710.10044.pdf

Describe alternatives you've considered

A clear and concise description of any alternative solutions or features you've considered.

How do we know when implementation of this feature is complete?

Checklist:

DriesSmit commented 2 years ago

Closing all TF issues as we are depreciating our TF systems.