Hi, just wondering if you have a reference for the exploration behaviour as implemented in the code? Since it's not discussed in the paper. As far as I can tell it's most similar to what's introduced in this paper: https://openreview.net/forum?id=OWZVD-l-ZrC, but not totally sure.
Hi, just wondering if you have a reference for the exploration behaviour as implemented in the code? Since it's not discussed in the paper. As far as I can tell it's most similar to what's introduced in this paper: https://openreview.net/forum?id=OWZVD-l-ZrC, but not totally sure.