Student project in deep reinforcement learning with the OpenAI Gym. We evaluated and analyzed how different model architectures performed as agents in various games.
When we stop training after a number of steps or episodes and save the logs and models we should be able to resume training the same agent given all these infos (DQNs are certainly capable, why aren't we?)
When we stop training after a number of steps or episodes and save the logs and models we should be able to resume training the same agent given all these infos (DQNs are certainly capable, why aren't we?)