TransfQMix release - Githubissues

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Apache License 2.0

414 stars 72 forks source link

TransfQMix release #58

Closed mttga closed 8 months ago

mttga commented 8 months ago

Here are the results average across 4 seeds:

test won: https://api.wandb.ai/links/mttga/2beham1m
test returns: https://api.wandb.ai/links/mttga/6yszv25o

Results are better than QMix for most of the maps with except of 3s5z and 3s5z_vs_3s_6z. But the main advantage of transformers is the potential transferability of the agent parameters and the learned qmix function between scenarios.

amacrutherford commented 8 months ago

Here are the results average across 4 seeds:
* test won: https://api.wandb.ai/links/mttga/2beham1m

* test returns: https://api.wandb.ai/links/mttga/6yszv25o
Results are better than QMix for most of the maps with except of 3s5z and 3s5z_vs_3s_6z. But the main advantage of transformers is the potential transferability of the agent parameters and the learned qmix function between scenarios.

awesome!