This repository is the implementation of Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification in ICML 2022. This codebase is based on the open-source maddpg-pytorch framework, and please refer to that repo for more documentation.
If you used this code in your research or found it helpful, please consider citing our paper:
@inproceedings{pan2021regularized,
title={Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification},
author={Pan, Ling and Huang, Longbo and Ma, Tengyu and Xu, Huazhe},
booktitle={International Conference on Machine
Learning},
year={2022}
}
pip install -e .
Datasets for different tasks are available at the following links. Please download the datasets and decompress them to the datasets folder.
Note: The datasets are too large, and the Baidu (Chinese) online disk requires a password for accessing it. Please just enter the password in the input box and click the blue button. The dataset can then be downloaded by cliking the "download" button (the second white button).
Please follow the instructions below to replicate the results in the paper.
pythonmain.py --env_id <ENVIRONMENT_NAME> --data_type <DATA_TYPE> --seed <SEED> --omar 1