Performance difference between TruncNormal and TanhNormal

danijar / dreamerv2

Mastering Atari with Discrete World Models

https://danijar.com/dreamerv2

MIT License

886 stars 195 forks source link

Performance difference between TruncNormal and TanhNormal #53

Open IcarusWizard opened 1 year ago

IcarusWizard commented 1 year ago

Hey @danijar.

I just noticed that the code is using TruncNormal as the actor distribution instead of TanhNormal as in v1. I wonder did you make some ablations on these two choices and see TruncNormal provide better results? Or the change is only because the entropy of TruncNormal is easier to compute than TanhNormal for the entropy regularizer?