Closed laszukdawid closed 2 years ago
Example lunar_lander_ppo_multi.py doesn't seem to converge and its results are sub-optimal.
Expected to have super-duper performance. The more agents the better everything, right?
Problem
Example lunar_lander_ppo_multi.py doesn't seem to converge and its results are sub-optimal.
Expected
Expected to have super-duper performance. The more agents the better everything, right?