Open MkuuWaUjinga opened 3 years ago
Thanks for the pointers. Addressed everything in the latest commits. I assume GridWorld and PointEnv don't have any seeds at all? Furthermore, with the implementation right now every worker has the same environment seed. This means that each worker always samples the same trajectory given a fixed action sequence. I think this is something we need to fix before merging?
Extend the
Environment
API to support setting environment library specific seeds.Tasks:
Environment
interfaceSampler
classes start workingOpen Questions: