-
Hi,
In line 241 of [sac.py ](https://github.com/openai/spinningup/blame/038665d62d569055401d91856abb287263096178/spinup/algos/pytorch/sac/sac.py#L215)
`
loss_pi = (alpha * logp_pi - q…
-
SAC algorithm in PyTorch implementation has a serious bug
`q_params = itertools.chain(ac.q1.parameters(), ac.q2.parameters())`
`itertools.chain` will become empty after the first iteration, so e…
-
in sac.py
s = torch.tensor([t.s for t in self.replay_buffer]).float().to(device)
Traceback (most recent call last):
File "D:\PycharmProject\Deep-reinforcement-learning-with-pytorch-master\Char09 …
-
I found this line that causes the extreme slowdown in runtime (thousands times slower).
https://github.com/Xingyu-Lin/mbpo_pytorch/blob/fe3c78c474d188c16a026051b92f8a2e84fa9387/sac/sac.py#L11
Set …
-
**SAC Log Prob**:
I am really confused about the log_prob equation used in the pytorch code:
https://github.com/openai/spinningup/blob/master/spinup/algos/pytorch/sac/core.py#L60
I realize that t…
-
Hi,
I am using torch 2.5.0.dev20240617+cu121 and I have the following error unsolved.
```
[rank0]:[rank0]: File "...../torchtitan/parallelisms/parallelize_llama.py", line 50, in checkpoint_wrapp…
-
Hello,
thanks for sharing and open sourcing the work.
After a quick read of the paper, I had several questions:
- did you do an ablation of UTD? in my experiments, UTD=10 may already be enough (at …
-
## Describe the bug
When training on `PettingZoo/MultiWalker-v9` with `Multi-Agent Soft Actor-Critic`, **all** losses (`loss_actor`, `loss_qvalue`, `loss_alpha`) explode after ~1M environment steps…
-
Hi, I have read the doc and config. But it seems that no flag to set the use_gpu. I have test the sac_pytorch and it seems the gpu is not used. The python is 3.6 and pytorch is 1.3.1
Could you please…
-
Hi,
thanks for releasing sac code.
I was wondering if you could reproduce the results of sac for HalfCheetah-v2 (10,000 around 1M)
I used code from this github too https://github.com/pranz24/pyto…