Open joon0503 opened 6 years ago
Current design is the most basic architecture for deep RL. Followings are some improvements which can be made for Q-learning.
Incorporate Input Cost
References https://github.com/Silvicek/DQN-for-autonomous-vehicles https://medium.freecodecamp.org/improvements-in-deep-q-learning-dueling-double-dqn-prioritized-experience-replay-and-fixed-58b130cc5682
Implemented target network.
Implemented dueling network. It seems double network is just target network?
Implemented double network and PER
Added HER
Current design is the most basic architecture for deep RL. Followings are some improvements which can be made for Q-learning.
Incorporate Input Cost
References https://github.com/Silvicek/DQN-for-autonomous-vehicles https://medium.freecodecamp.org/improvements-in-deep-q-learning-dueling-double-dqn-prioritized-experience-replay-and-fixed-58b130cc5682