cathywu / rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
Other
1 stars 0 forks source link

Spatial discounting #13

Open cathywu opened 7 years ago

cathywu commented 7 years ago
cathywu commented 7 years ago

Testing done:

python3 examples/cluster_multiagent_point_comparison.py

Here's some small indication that the tensor restructuring and reshaping hasn't totally screwed things up: learning is still happening. 2017-05-08-multiagentsharedenv-perhapsimplementationisok

cathywu commented 7 years ago

Summary: cluster-multiagent-shared-v1 (commit cf52512)

cathywu commented 7 years ago

cluster-multiagent-shared-v2 (commit a7e2bce)

Example visualization of LIDAR view from agent=0, initial state from 2 concurrently sampled environments with 50 agents, 10 angular slices of the space. Red is agent0, blue are the other agents, purple crosses are the lidar measurements projected back into cartesian coordinates: lidar-reset--3335646985623688744-agent0 lidar-reset-592221439877259022-agent0

cathywu commented 7 years ago

Update: cluster-multiagent-shared-v2 (commit a7e2bce)

After clipping actions, movements seem more reasonable between Steps: lidar-reset--3597117061730238138-agent0 lidar-7851020778695200524-agent0