RLE-Foundation / rllte

Long-Term Evolution Project of Reinforcement Learning
https://docs.rllte.dev/
MIT License
467 stars 86 forks source link

[Question] Why is the default discount factor 0.999 for PPO and 0.99 for DQN? #40

Open roger-creus opened 1 year ago

roger-creus commented 1 year ago

❓ Question

I would think 0.99 is the most widely used default value, so I suggest using that for PPO as well.

Checklist

yuanmingqi commented 1 year ago

0.999 is used for the Procgen games.