Open tgangwani opened 7 years ago
I am trying to get a sense of the time the implementation takes to achieves a good average score for Pong. The DeepMind paper gets to an average of +20 in about 4 hours of training. How does this implementation fair?
I am trying to get a sense of the time the implementation takes to achieves a good average score for Pong. The DeepMind paper gets to an average of +20 in about 4 hours of training. How does this implementation fair?