Open MaxASchwarzer opened 2 years ago
Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).
:memo: Please visit https://cla.developers.google.com/ to sign.
Once you've signed (or fixed any issues), please reply here with @googlebot I signed it!
and we'll verify it.
ℹ️ Googlers: Go here for more info.
@googlebot I signed it!
I'd like to add support for SPR to the Atari 100k lab project. My implementation is mostly siloed to minimize changes to the existing algorithms. SPR needs a custom model-based style replay buffer that returns subtrajectories, and a version of noisy nets that allows noise to be toggled on and off within a function, both of which are included.
This implementation performs a bit better than the original version; I found median 0.45 over 100 seeds, compared to 0.395 for the original.
@psc-g @agarwl