Closed pfrendl closed 7 years ago
The timestep parameters of the SharedAdam optimizer are not shared. This should lead to bias overcorrection, leading to incorrect unbiased estimates. Does the current implementation work?
Yes, that's true. Thanks! I will fix it in the next couple of days.
Fixed in https://github.com/ikostrikov/pytorch-a3c/commit/5d9b07d80740e26f78cb283f74b5b802906a9d83
The timestep parameters of the SharedAdam optimizer are not shared. This should lead to bias overcorrection, leading to incorrect unbiased estimates. Does the current implementation work?