Closed sash-a closed 6 months ago
Use rlax where we can, so that we don't have to implement and test common methods.
rlax
As far as I can see we can use it for policy/critic loss and GAE.
Investigated, it massively slowed down existing systems :disappointed:
Feature
Use
rlax
where we can, so that we don't have to implement and test common methods.Proposal
As far as I can see we can use it for policy/critic loss and GAE.
Benchmarking