steveKapturowski / tensorflow-rl

Implementations of deep RL papers and random experimentation
Apache License 2.0
177 stars 47 forks source link

PseudoCountQLearner #23

Open xfdywy opened 6 years ago

xfdywy commented 6 years ago

In CTS-DQN, why we update the CTS model by using the next frame but not the same frame as used by action selection? refer to https://github.com/steveKapturowski/tensorflow-rl/blob/master/algorithms/intrinsic_motivation_actor_learner.py#L417