google-deepmind / rlax

https://rlax.readthedocs.io
Apache License 2.0
1.24k stars 85 forks source link

PopArt example bug #111

Closed kinalmehta closed 1 year ago

kinalmehta commented 1 year ago

Hi,

It is mentioned in the comments here that new popart states should be used to normalize/denormalize, but in the code old states are being used.

Thanks Kinal

jqdm commented 1 year ago

Thanks for spotting that, the example is fixed now.