Closed nguyen-thanh05 closed 9 months ago
Also included a very crude implementation of deep Q learning. Need to reimplement the replay buffer properly, otherwise for now it is still quite unstable
Also included a very crude implementation of deep Q learning. Need to reimplement the replay buffer properly, otherwise for now it is still quite unstable