-
Yu-zx updated
7 months ago
-
Disclaimer: I am not completely sure if this is a bug of PFRL.
When I ran SAC, and TD3 on my university's cluster without a GPU, I observed that memory usage gradually increased and finally reached…
-
Implement `ActorCritic` such that the constructor only takes a `FunctionApproximator` (and some extra config parameters).
```python
class Func(km.FunctionApproximator):
def body(self, S):
…
-
[Soft Actor-Critic for Discrete Action Settings](https://arxiv.org/abs/1910.07207v1)
-
Hi,
thanks for releasing sac code.
I was wondering if you could reproduce the results of sac for HalfCheetah-v2 (10,000 around 1M)
I used code from this github too https://github.com/pranz24/pyto…
-
Hi @joeybose ,
I tried your commands in this file: https://github.com/joeybose/FloRL/blob/flows/pytorch-soft-actor-critic/runFlows.sh
It seemed that your programs do not learn. Do you have recor…
ghost updated
3 years ago
-
Implement automatic tuning of temperature parameter of entropy and reproduce results from [Soft Actor-Critic Algorithms and Applications](https://arxiv.org/abs/1812.05905).
-
### 🚀 Feature
Hi!
I would like to implement a recurrent soft actor-critic. Is it a sensible contribution?
### Motivation
I actually need this algorithm in my projects.
### Pitch
The sb3 e…
-
I am unable to obtain the result as reported in the paper ‘Soft Actor-Critic Algorithms and Applications ’ on the openai environment Humanoid-v2. The result is 6000 while the original paper is 8000, …
-
Hello, I need to make SacAgent work with discrete action, so try to implement GumbelSoftmax parameterization trick by re-defining the relevant classes. However, the calculation of `agent.train(experie…