Hi!
I think there is a small bug in the PEARLWorker class for the PEARL algorithm.
The ´deterministic´ variable should specify if the action should be sampled from the action distribution or if the action should be the mean of the distribution, but in the current implementation the action is always sampled even if ´deterministic == True´.
Hi! I think there is a small bug in the PEARLWorker class for the PEARL algorithm. The ´deterministic´ variable should specify if the action should be sampled from the action distribution or if the action should be the mean of the distribution, but in the current implementation the action is always sampled even if ´deterministic == True´.
https://github.com/rlworkgroup/garage/blob/master/src/garage/torch/algos/pearl.py#L742-L745