Open cathywu opened 7 years ago
Testing done:
python3 examples/cluster_multiagent_point_comparison.py
Here's some small indication that the tensor restructuring and reshaping hasn't totally screwed things up: learning is still happening.
TRPOShared
, NPOShared
, SharedGaussianMLPPolicy
, and .shared_policy
attribute).cluster-multiagent-shared-v1
(see cluster_multiagent_point_comparison.py): spatial discounting (with spatial discount rates [0.5, 0.7, 0.8, 0.97, 0.99, 0.995, 1]
, various numbers of agents, batch sizes, etc.)Example visualization of LIDAR view from agent=0
, initial state from 2 concurrently sampled environments with 50 agents, 10 angular slices of the space. Red is agent0, blue are the other agents, purple crosses are the lidar measurements projected back into cartesian coordinates:
After clipping actions, movements seem more reasonable between Steps: