rlcode / reinforcement-learning

Minimal and Clean Reinforcement Learning Examples
MIT License
3.35k stars 725 forks source link

Dqn-per does not use importance sampling weight in training。 #88

Open xunyiljg opened 5 years ago

xunyiljg commented 5 years ago

Dqn_per has no important sampling weight in training, which should be a problem?