RL
Implemented in Tensorflow 2.x
- Atari (New_Atari_DQN_image.ipynb, atari_DDDQN.ipynb are correctly implemented but not able to fully train on colab(takes lot of time to train, Will try to get GPU on GCP) )
- Mountain Car with QLearning (works)
- Train using qlearning and DeepQLearning (works)
- Dueling Double Deep Q Learning (works)
- Deep Q learning (works)
- ActorCritic (works)
- Reinforce(Policy Gradient) (works)
- A2C (works)
- Multi-Worker_Actor-Critic(A2C) (works)
- Proximal Policy Optimization (PPO) (works)
- Deep Deterministic Policy Gradients (DDPG) (works)
- Twin Delayed Deep Deterministic Policy Gradient (TD3) (works)
- Soft Actor Critic (softAC) (works, reparameterized sampling not used as of now)
NOTE:- Inside Atari_DQN_image.ipynb, implementation of preprocessing for stacked frame was not correct, so i have uploaded new files with correct implementation.
Recommended resources to learn RL:
- https://www.coursera.org/specializations/reinforcement-learning (one of the best course for RL fundamentals)
- https://mitpress.mit.edu/books/reinforcement-learning-second-edition (Book)
- https://www.youtube.com/channel/UC58v9cLitc8VaCjrcKyAbrw (best for coding)
- https://adventuresinmachinelearning.com (very useful)
- https://www.youtube.com/playlist?list=PLWzQK00nc192L7UMJyTmLXaHa3KcO0wBT (Very useful for overview of different algorithms)
Thanks to:-
- https://www.youtube.com/channel/UC58v9cLitc8VaCjrcKyAbrw
- https://www.freecodecamp.org/news/an-introduction-to-deep-q-learning-lets-play-doom-54d02d8017d8/
- https://rubikscode.net/2019/07/22/deep-convolutional-q-learning-with-python-and-tensorflow-2-0/
- https://pythonprogramming.net/training-deep-q-learning-dqn-reinforcement-learning-python-tutorial/
- https://www.packtpub.com/big-data-and-business-intelligence/hands-reinforcement-learning-python
- http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html
- https://github.com/udacity/deep-reinforcement-learning