openai / maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
https://arxiv.org/pdf/1706.02275.pdf
MIT License
1.59k stars 484 forks source link

reward is too large #51

Open Sherry-97 opened 3 years ago

Sherry-97 commented 3 years ago

When I ran python train.py --scenario simple, the reward function was too large, but I didn't change the code.