Closed DanielS684 closed 3 years ago
Take a look at the encode method, there's a stop_grad when ema is set to True
@MishaLaskin Yeah just read through the code again and realized my mistake since it was using the encoder from the critic_target instead. Really awesome work though
I noticed when reading through the paper and the code that your pseudocode in the paper says that the key encoder needs to be detached from the graph but in your actual code you don't set detach = True for
z_pos = self.CURL.encode(obs_pos, ema=True)
. I wanted to know whether the paper or code is correct. Or maybe I am missing some part of the computation.This is what is in the code for curl_sac.py:
and this is what is in the pseudocode for the paper: