Same action in multi-agent environment

Kaixhin / Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

MIT License

1.59k stars 284 forks source link

Hello, thank you for your contribution! I am a student and recently I am running a multi-agent program with your code and I am suffering from a problem. I'm using Unity3d to simulate multi-robots experiments and send the observations(a camera image and several sensors' information) to the python script. When I feed the states to the network, the output actions are the same. e.g. We have 9 agents and each agent can choose 8 different actions, these actions can be {0,1,2,3,4,5,6,7}, when we feed the state to network, the outputs are always the same action for every time step, such as {1,1,1,1,1,1,1,1,1} or {2,2,2,2,2,2,2,2,2}, etc. Do you have any idea about this kind of problem?

Kaixhin / Rainbow

Same action in multi-agent environment #32