laermannjan / nip-deeprl-project

Student project in deep reinforcement learning with the OpenAI Gym. We evaluated and analyzed how different model architectures performed as agents in various games.
0 stars 0 forks source link

Change how timestep limit gets set. #6

Closed laermannjan closed 7 years ago

laermannjan commented 7 years ago

As until now we naively end iteration after config.max_timesteps, which breaks wrapping with gym.wrappers.Monitor. According to OpenAI Devs we should rather change it like this

laermannjan commented 7 years ago

Fixed in 727a3b9