dfridovi / rl

A homebrewed C++ library for reinforcement learning.
https://dfridovi.github.io/rl
Other
0 stars 0 forks source link

Random downsampling of the experience replay unit #5

Open dfridovi opened 7 years ago

dfridovi commented 7 years ago

Try randomly downsampling past experience before collecting each set of new rollouts. This way random samples of past experience are more likely to be new.