Dqn-per does not use importance sampling weight in training。

rlcode / reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

MIT License

3.35k stars 725 forks source link

Open xunyiljg opened 5 years ago

xunyiljg commented 5 years ago

Dqn_per has no important sampling weight in training, which should be a problem?