RLE-Foundation / rllte

Long-Term Evolution Project of Reinforcement Learning
https://docs.rllte.dev/
MIT License
453 stars 84 forks source link

[Question] Why is the default discount factor 0.999 for PPO and 0.99 for DQN? #40

Open roger-creus opened 8 months ago

roger-creus commented 8 months ago

❓ Question

I would think 0.99 is the most widely used default value, so I suggest using that for PPO as well.

Checklist

yuanmingqi commented 8 months ago

0.999 is used for the Procgen games.