fccoelho / Reinforcement-Learning-course

Advanced course on Reinforcement Learning.

GNU General Public License v3.0

29 stars 4 forks source link

readme

Reinforcement-Learning-course

Advanced course on Reinforcement Learning.

Sylabus

Module 1: Introduction to Reinforcement Learning

Overview of Reinforcement Learning and its applications
Markov Decision Processes (MDPs) and Bellman Equations
Q-Learning and SARSA algorithms

Module 2: Temporal-Difference methods

TD Learning
TD prediction
SARSA and TD control

Module 3: Monte Carlo Methods

First-Visit Monte Carlo and Every-Visit Monte Carlo methods
On-Policy and Off-Policy methods
Importance Sampling

Module 4: Function Approximation

Introduction to function approximation for Reinforcement Learning
Overview of Deep Reinforcement Learning
Hands-on experience with Gymnasium environment

Module 5: Project Work and Conclusion

Final project: students will work on a real-world Reinforcement Learning problem using the techniques and tools learned in the course
Course conclusion and future directions in Reinforcement Learning research

References:

"Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto (2018)
Python implementation based on the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto (2018)
"Deep Reinforcement Learning Hands-On" by Maxim Lapan (2018)
Code for the book "Deep Reinforcement Learning Hands-On" by Maxim Lapan (2018)
Gymnasium environment (https://gymnasium.farama.org/)
TensorFlow documentation (https://www.tensorflow.org/guide)
PyTorch documentation (https://pytorch.org/docs/stable/index.html)
AlphaGo Documentary
Monte Carlo Tree Search Another Introduction
Stable Baselines
Application to cartpole. Use of vectorized environments.
Trackmania bot Training an AI to learn to win at Trackmania game.
Code to train a Trackmania bot