Student project in deep reinforcement learning with the OpenAI Gym. We evaluated and analyzed how different model architectures performed as agents in various games.
0
stars
0
forks
source link
Stop training on max_episodes rather than max_steps #15
This way we could have have equally long logs of all agents (i.e. number of episode rewards, lengths, etc) and could overlay such a plot with another plot showing the total number of steps taken at each episode to give an idea how fast episode length is declining.
This way we could have have equally long logs of all agents (i.e. number of episode rewards, lengths, etc) and could overlay such a plot with another plot showing the total number of steps taken at each episode to give an idea how fast episode length is declining.