Reinforcement-Learning-course
Advanced course on Reinforcement Learning.
Sylabus
Module 1: Introduction to Reinforcement Learning
- Overview of Reinforcement Learning and its applications
- Markov Decision Processes (MDPs) and Bellman Equations
- Q-Learning and SARSA algorithms
Module 2: Temporal-Difference methods
- TD Learning
- TD prediction
- SARSA and TD control
Module 3: Monte Carlo Methods
- First-Visit Monte Carlo and Every-Visit Monte Carlo methods
- On-Policy and Off-Policy methods
- Importance Sampling
Module 4: Function Approximation
- Introduction to function approximation for Reinforcement Learning
- Overview of Deep Reinforcement Learning
- Hands-on experience with Gymnasium environment
Module 5: Project Work and Conclusion
- Final project: students will work on a real-world Reinforcement Learning problem using the techniques and tools learned in the course
- Course conclusion and future directions in Reinforcement Learning research
References:
- "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto (2018)
- Python implementation based on the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto (2018)
- "Deep Reinforcement Learning Hands-On" by Maxim Lapan (2018)
- Code for the book "Deep Reinforcement Learning Hands-On" by Maxim Lapan (2018)
- Gymnasium environment (https://gymnasium.farama.org/)
- TensorFlow documentation (https://www.tensorflow.org/guide)
- PyTorch documentation (https://pytorch.org/docs/stable/index.html)
- AlphaGo Documentary
- Monte Carlo Tree Search Another Introduction
- Stable Baselines
- Application to cartpole. Use of vectorized environments.
- Trackmania bot Training an AI to learn to win at Trackmania game.
- Code to train a Trackmania bot