google-deepmind / meltingpot

A suite of test scenarios for multi-agent reinforcement learning.
Apache License 2.0
577 stars 116 forks source link

Running ACB Baseline #162

Closed AsadJeewa closed 1 year ago

AsadJeewa commented 1 year ago

How can I run the ACB baseline? self_play_train.py runs PPO and says "The ACB models used as baselines in the meltingpot paper were not run using rllib". I would like to run ACB on a modified version of cleanup. Thanks

duenez commented 1 year ago

We don't have an open source implementation of the ACB baseline, and it would be hard to open source, as it relies on significant internal tooling and infrastructure. We anticipate some async actor critic methods will be provided as part of the Melting Pot Competition. Those will not be our internal implementation, but will be good reference points for open source RL baselines on Melting Pot. So, stay tuned! :)