sac-pytorch Search Results

openai/spinningup #304

Pytorch SAC alpha sign

Hi, In line 241 of [sac.py ](https://github.com/openai/spinningup/blame/038665d62d569055401d91856abb287263096178/spinup/algos/pytorch/sac/sac.py#L215) ` loss_pi = (alpha * logp_pi - q…

jose-alatorre-harvard updated 3 years ago

openai/spinningup #329

serious BUG in sac pytorch implementation

SAC algorithm in PyTorch implementation has a serious bug `q_params = itertools.chain(ac.q1.parameters(), ac.q2.parameters())` `itertools.chain` will become empty after the first iteration, so e…

zlw21gxy updated 2 years ago

sweetice/Deep-reinforcement-learning-with-pytorch #38

SAC_Bug

in sac.py s = torch.tensor([t.s for t in self.replay_buffer]).float().to(device) Traceback (most recent call last): File "D:\PycharmProject\Deep-reinforcement-learning-with-pytorch-master\Char09 …

aut6620 updated 2 years ago

Xingyu-Lin/mbpo_pytorch #8

Very slow runtime caused by `torch.autograd.set_detect_anoma…

I found this line that causes the extreme slowdown in runtime (thousands times slower). https://github.com/Xingyu-Lin/mbpo_pytorch/blob/fe3c78c474d188c16a026051b92f8a2e84fa9387/sac/sac.py#L11 Set …

mickelliu updated 11 months ago

openai/spinningup #279

SAC log_prob computation and Q Loss

**SAC Log Prob**: I am really confused about the log_prob equation used in the pytorch code: https://github.com/openai/spinningup/blob/master/spinup/algos/pytorch/sac/core.py#L60 I realize that t…

rojas70 updated 3 years ago

pytorch/torchtitan #439

A bug related to Torch version

Hi, I am using torch 2.5.0.dev20240617+cu121 and I have the following error unsolved. ``` [rank0]:[rank0]: File "...../torchtitan/parallelisms/parallelize_llama.py", line 50, in checkpoint_wrapp…

zyushun updated 1 week ago

ikostrikov/walk_in_the_park #3

Question about the paper/implementation

Hello, thanks for sharing and open sourcing the work. After a quick read of the paper, I had several questions: - did you do an ablation of UTD? in my experiments, UTD=10 may already be enough (at …

araffin updated 1 week ago

pytorch/rl #2199

[BUG] Numerical Instability issues with `torchrl.modules.Tan…

## Describe the bug When training on `PettingZoo/MultiWalker-v9` with `Multi-Agent Soft Actor-Critic`, **all** losses (`loss_actor`, `loss_qvalue`, `loss_alpha`) explode after ~1M environment steps…

N00bcak updated 1 month ago

openai/spinningup #302

Hi, I have read the doc and config. But it seems that no flag to set the use_gpu. I have test the sac_pytorch and it seems the gpu is not used. The python is 3.6 and pytorch is 1.3.1 Could you please…

ChenyangRan updated 3 years ago

jendelel/rhl-algs #2

reproducibility of sac for halfcheetah

Hi, thanks for releasing sac code. I was wondering if you could reproduce the results of sac for HalfCheetah-v2 (10,000 around 1M) I used code from this github too https://github.com/pranz24/pyto…

tldoan updated 5 years ago

383 results
for sac-pytorch