moabitcoin / cherry-pytorch

Reinforcement Learning Tutorials & other bedtime stories in PyTorch
MIT License
11 stars 1 forks source link

Optimize replay buffer #7

Closed sandhawalia closed 4 years ago

sandhawalia commented 4 years ago

Currently we keep 2 * 4 frame states in buffer this can be reduced to 5 with first 4 frames the current state and last 4 the next state

sandhawalia commented 4 years ago

Addressed https://github.com/moabitcoin/cherry-pytorch/pull/9