Open dfridovi opened 7 years ago
Try randomly downsampling past experience before collecting each set of new rollouts. This way random samples of past experience are more likely to be new.
Try randomly downsampling past experience before collecting each set of new rollouts. This way random samples of past experience are more likely to be new.