-
Similar to #355226 I am also getting the same Value error but while handling gradient tape.
```
Traceback (most recent call last):
File "train_model.py", line 211, in
main()
File "trai…
-
I am not able to run training using tf.distribute.Strategy
However, it works fine without distribution.
Below is the code block for training loop
```
from __future__ import absolute_import
from…
-
Hello! Could you provide the hyperparameters and the number of training steps of each MuJoCo env to reproduce the Table5 results?(Appendix D.2 in the original paper)
I've tried the `iq_learn/scripts/…
-
Hey Brax Community,
I am not an expert on SAC, but the `alpha` update within the SAC implementation looks a bit weird to me. I might also be missing some mathematical identity or jax trick that resol…
-
# Description
This story is about doing research over Soft Actor-Critic and documenting it on the wiki.
# Why do we need to do this?
This story is needed for getting the know how Soft Actor-C…
-
elegantRL_helloworld
![image](https://user-images.githubusercontent.com/8056302/156368347-2fe4cc3f-5d95-4538-8acb-3a475dcffb03.png)
elegantRL
![image](https://user-images.githubusercontent.com/80…
-
-
-
Question1:
From [here ](https://github.com/rail-berkeley/softlearning/blob/a01fd32b9ef80d875543a49673b7c2b0be516c74/softlearning/algorithms/sac.py#L37)`heuristic_target_entropy`, I see the initializa…
-
In `train_sac.py` there is an import `from sac import Network` which gives an error!
How did you install soft actor-critic (sac)?
Default installation using `pip install sac` leads to `Network` n…