raharth / PyMatch

A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms
MIT License
13 stars 2 forks source link

Implement SARSA #24

Open raharth opened 3 years ago

raharth commented 3 years ago

Implement the general SARSA algorithm according to the definition of Barto and Sutton

raharth commented 3 years ago

There is a first implementation which fails to converge though. Could be due to a bug but also due to bad hyper params

raharth commented 3 years ago

There is a paper on it Deep Reinforcement Learning with Sarsa and Q-Learning: A Hybrid Approach by Xu et al.

Their results are somewhat strange though. Their baseline of DQN/DDQN is worse than mine even though I didn't little to no tuning.

Even though they implement the exactly theoretical idea, I fail to make it converge while theirs converges (suboptimal to my baselines)