FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX
Apache License 2.0
393 stars 68 forks source link

Corrected Hanabi, new Dockerfile, python 3.10 and other fixes #71

Closed mttga closed 5 months ago

mttga commented 5 months ago

This merge contains the following updates:

ippo_ff_hanabi

mttga commented 5 months ago

It says 10k additions which is crazy but it's because we're adding a ground-truth file for testing hanabi which contains 10k game scores.

benellis3 commented 5 months ago

Have you tested that the PPO code runs with the new requirements for other environments? Otherwise this could break a lot of stuff.

mttga commented 5 months ago

Have you tested that the PPO code runs with the new requirements for other environments? Otherwise this could break a lot of stuff.

I've checked that all the scripts run and I collected results for both IPPO and MAPPO with rnns in mpe and smax here: https://wandb.ai/mttga/jaxmarl_pull_request_71?nw=nwusermttga

Let me know if you want to see other results

mttga commented 5 months ago

Overcooked: overcooked_cramped_room_new

benellis3 commented 5 months ago

LGTM. Thanks very much for all this hard work this is awesome 😄 . Feel free to merge when you are ready.