laszukdawid / ai-traineree

PyTorch agents and tools for (Deep) Reinforcement Learning
http://ai-traineree.readthedocs.io/
Apache License 2.0
25 stars 6 forks source link

PPO on MultiEnvRunner doesn't work properly #13

Closed laszukdawid closed 2 years ago

laszukdawid commented 3 years ago

Problem

Example lunar_lander_ppo_multi.py doesn't seem to converge and its results are sub-optimal.

Expected

Expected to have super-duper performance. The more agents the better everything, right?