Closed RPegoud closed 11 months ago
The experiment aims to compare Q-learning and Sarsa (possibly expected Sarsa) on the Cliff Walking environment, to outline the difference in behaviours between the two algorithms.
Here are the results to reproduce:
The experiment aims to compare Q-learning and Sarsa (possibly expected Sarsa) on the Cliff Walking environment, to outline the difference in behaviours between the two algorithms.
Here are the results to reproduce: