Closed Weiyuhong-1998 closed 2 years ago
Yes, in Multi-agent mujoco environment, all agents can see the global information, so it's MDP, not POMDP, we use the same setting for other algorithms, and try to figure out the cooperative relation among agents. We use SMAC environment to verify our algo also can work well in POMDP setting. If you want to see the performance of our algorithm in multi agent mujoco with POMDP setting, you can modify the function get_obs(). Hope my answer can help you.
I notice that in your multi-agent mujoco environment codes,
They all use self.env._get_obs() and will return the same obs information, so in your codes, what the differences between get_obs() and get_state(), and how do you use global information and local information in your algorithm?