Open ugurkanates opened 3 years ago
Presentation Resources:
https://www.youtube.com/watch?v=5P7I-xPq8u8 https://www.youtube.com/watch?v=vQ_ifavFBkI https://www.youtube.com/watch?v=wM-Sh-0GbR4 https://towardsdatascience.com/on-policy-v-s-off-policy-learning-75089916bc2f https://openai.com/blog/openai-baselines-ppo/ https://jonathan-hui.medium.com/rl-proximal-policy-optimization-ppo-explained-77f014ec3f12 https://arxiv.org/pdf/1707.06347.pdf
Presentation Resources:
https://www.youtube.com/watch?v=5P7I-xPq8u8 https://www.youtube.com/watch?v=vQ_ifavFBkI https://www.youtube.com/watch?v=wM-Sh-0GbR4 https://towardsdatascience.com/on-policy-v-s-off-policy-learning-75089916bc2f https://openai.com/blog/openai-baselines-ppo/ https://jonathan-hui.medium.com/rl-proximal-policy-optimization-ppo-explained-77f014ec3f12 https://arxiv.org/pdf/1707.06347.pdf