cathywu / rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
Other
1 stars 0 forks source link

Spatial discounting #14

Open cathywu opened 7 years ago

cathywu commented 7 years ago

Tracking the status.

cathywu commented 7 years ago

Summary [cluster-multiagent-shared-v4]

Seems that MultiagentSharedEnv, even without collisions is quite difficult of an environment.

Test-time rollout for k=6: video (Summary: in 1000 steps, 5 of 6 agents reach the target, although they get close within a few steps.)

cathywu commented 7 years ago

Summary [cluster-multiagent-shared-v6]

MultiagentSharedEnv, done_epsilon=0.01, batch_size=10000, k=50 (WHY does (exponential) discount factor of 0.5 cause such regression?): 2017-05-16-multiagentsharedenv-spatialdiscount-doneeps0 01-batch10000-k50

Next I will try binary and linear discounting instead of exponential.

cathywu commented 7 years ago

Summary [cluster-multiagent-shared-v7]

Results:

Pending:

MultiagentSharedEnv, done_epsilon=0.01, batch_size=10000, collision_penalty=50, k=25 (first positive result on spatial discounting!): 2017-05-16-multiagentsharedenv-binaryspatialdiscount-donesps0 01-batch10000-k25

MultiagentSharedEnv, done_epsilon=0.01, batch_size=10000, collision_penalty=50, k=50: 2017-05-16-multiagentsharedenv-binaryspatialdiscount-donesps0 05-batch10000-k25

cathywu commented 7 years ago

Summary [cluster-multiagent-shared-v9]

Note: local run, 1 seed

Results:

2017-06-15-multiagentsharedenv-assymetricreward