openai / multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
https://arxiv.org/pdf/1706.02275.pdf
MIT License
2.27k stars 786 forks source link

NN code #86

Open abeerM opened 3 years ago

abeerM commented 3 years ago

According to the paper "our policies are parameterized by a two-layer ReLU MLP with 64 units per layer. To support discrete communication messages, we use the Gumbel-Softmax estimator [14]." However, I could not find it in the code! The policy is hardcoded (policy.py )based on the keyboard input, so what if my environment does not require input from the user

Appreciate explaining that point